Legal

AI & Data Disclosures

Effective date: March 18, 2026

aclearly uses artificial intelligence to extract, enrich, and analyze document data. This page provides transparency about how AI is used, its limitations, and how your data flows through the system.

1. AI Models Used

Model	Provider	Purpose
GPT-4o	Azure OpenAI Service	Document data extraction — reads uploaded PDFs and images, returns structured JSON with field values and confidence scores
GPT-4o	Azure OpenAI Service	Content enrichment — normalizes vendor names, dates, currencies; generates categories and keywords
GPT-4o	Azure OpenAI Service	Insights — answers natural-language questions about your document data
text-embedding-ada-002	Azure OpenAI Service	Generates vector embeddings for semantic search and duplicate detection

2. How Documents Are Processed

When you upload a document, the following pipeline runs:

Upload — Your file is received over HTTPS and temporarily held in server memory
AI Extraction — The document is converted to base64 and sent to Azure OpenAI GPT-4o with a structured extraction prompt. The model returns JSON containing detected fields, values, and confidence scores
Storage — The original document is stored in encrypted cloud storage (Supabase Storage). The extracted JSON is saved to the database
Embedding — A text summary of extracted fields is sent to Azure OpenAI's embedding model. The resulting vector is stored for semantic search
Enrichment — GPT-4o normalizes and enriches the extracted data (vendor names, dates, categories). This runs asynchronously and does not block the extraction response
Anomaly Detection — The system checks for duplicate documents, unusual amounts, new vendors, and stale invoices using statistical baselines. Alerts are generated when anomalies are detected

3. What Data Is Sent to AI Models

Document Extraction

The full content of your uploaded document (as base64-encoded PDF or image) is sent to Azure OpenAI. This is necessary for the model to read and extract data from the document.

Content Enrichment

Extracted field names and values (not the original document) are sent to Azure OpenAI for normalization and categorization.

Insights Queries

When you ask a question in the Insights interface, your question and relevant extracted text from matching documents are sent to Azure OpenAI to generate an answer.

Embeddings

A plain-text summary of extracted field names and values is sent to Azure OpenAI's embedding model. Only extracted text is sent, not the original document.

4. Your Data Is Not Used for AI Training

aclearly uses Microsoft Azure OpenAI Service, which operates under enterprise data processing terms. Under these terms:

Your data is not used to train, retrain, or improve Azure OpenAI models
Your data is not available to other Azure OpenAI customers
Your data is not used by Microsoft to improve its AI products
Prompts and completions are not stored by Azure OpenAI beyond the API request lifecycle (with abuse monitoring opt-out applied)

See Microsoft's Azure OpenAI data privacy documentation for details.

5. AI Accuracy and Limitations

AI-powered extraction is not perfect. Important limitations include:

Accuracy varies by document quality — handwritten text, poor scans, unusual layouts, and non-English documents may produce lower accuracy
Confidence scores are estimates — a high confidence score does not guarantee correctness
Field detection is probabilistic — the model may miss fields, hallucinate values, or misinterpret ambiguous content
Enrichment is AI-generated — normalized vendor names, categories, and summaries are generated by AI and may contain errors
Insights are grounded but not infallible — answers are based on your document data, but the AI may misinterpret or miscalculate when synthesizing across multiple documents

You should always review and validate extracted data before relying on it for financial, legal, or operational decisions.

6. Automated Decision-Making

The Service performs automated analysis that may generate alerts or recommendations:

Duplicate detection — documents with >95% semantic similarity to existing documents are flagged
Amount anomaly detection — invoice amounts >2 standard deviations from a vendor's historical average are flagged
New vendor detection — first-time vendors are flagged for awareness
Stale invoice detection — invoices dated more than 6 months ago are flagged

These are informational alerts only. No automated decisions are made that produce legal or similarly significant effects. You retain full control over all business decisions.

Under GDPR Article 22, you have the right to request human review of any automated processing. Contact privacy@aclearly.com.

7. AI-Related Data Retention

Data Type	Retention
Original documents	Until you delete them or close your account
Extracted JSON	Until you delete the extraction or close your account
Vector embeddings	Until you delete the extraction or close your account
Enrichment data	Until you delete the extraction or close your account
Anomaly alerts	Until dismissed or account closure
Vendor baselines	Until account closure
Data sent to Azure OpenAI	Not retained by Azure after API response

8. Opting Out

AI processing is core to the Service. You cannot use aclearly without AI-powered extraction. However:

You can delete any document and its associated extraction, embedding, and enrichment data at any time
You can dismiss anomaly alerts
You can close your account and all data will be permanently deleted within 30 days
You can request a full data export before deletion by contacting privacy@aclearly.com

9. Changes to AI Systems

We may update the AI models, prompts, or processing pipeline used in the Service. If we make material changes to how your data is processed by AI (such as switching providers or adding new categories of AI processing), we will notify you via email at least 30 days before the change takes effect.

10. Questions

For questions about AI processing, data handling, or these disclosures:
privacy@aclearly.com