Privacy

The Hidden Cost of Using ChatGPT with Client Data

ChatGPT privacy risks are real. Learn how client data entered into AI tools can be exposed, retained, and used for training — and what regulated professionals must do now.

PrivacyFrom.AI TeamMarch 5, 202612 min read

Every Prompt Is a Disclosure

Every day, thousands of attorneys, financial advisors, therapists, and healthcare administrators paste confidential client data into ChatGPT. They draft memos with real patient names, summarize case files with actual Social Security numbers, and ask AI to analyze financial records containing account details — all without realizing that every single prompt is a potential data disclosure to a third party.

The ChatGPT privacy risks are not hypothetical. They are structural. The moment you type a client's name, diagnosis, or legal matter into a consumer AI tool, that data leaves your control. It travels to servers you don't own, is processed by systems you can't audit, and may be retained in ways that violate the very regulations your profession is built on.

This article is the one you forward to your managing partner, your compliance officer, or your IT department. Because the cost of ignoring this problem is not theoretical — it is measured in regulatory fines, malpractice liability, and destroyed client trust.

What Actually Happens to Your Data Inside ChatGPT

Most professionals assume that ChatGPT works like a search engine: you ask a question, you get an answer, and your query disappears. That assumption is dangerously wrong.

Here is what actually happens when you enter client data into major AI platforms:

Data Retention

OpenAI's privacy policy, updated in 2025, states that user inputs and outputs may be retained for up to 30 days for trust and safety purposes, even when chat history is turned off. For users on the free and Plus tiers, conversations are retained indefinitely unless manually deleted. That means a prompt containing a patient's HIV status or a client's pending merger details may sit on OpenAI's servers for a month — or forever.

Model Training

Prior to OpenAI's opt-out changes in 2023, all user inputs were eligible for model training by default. While business and enterprise tiers now exclude training by default, free and Plus tier users must manually opt out through a settings toggle that most people never find. Google's Gemini follows a similar pattern: conversations with the free tier may be reviewed by human annotators and used for product improvement.

Third-Party Access

Under OpenAI's terms of service, the company reserves the right to share data with "service providers" and may disclose information to comply with legal obligations. In practice, this means a subpoena directed at OpenAI could surface your client's data — data that you voluntarily handed over.

Human Review

OpenAI's documentation confirms that human reviewers may examine conversations flagged by automated systems. Anthropic, Google, and other providers have similar review processes. A healthcare administrator who types a patient complaint containing detailed medical history is potentially exposing that information to an unknown human reviewer at a tech company, with no BAA in place.

Real-World Scenarios: How Data Leaks Actually Happen

These are not edge cases. They are patterns we see repeatedly across regulated industries.

Scenario 1: The Healthcare Administrator

A hospital's administrative coordinator receives a patient complaint that includes the patient's full name, date of birth, diagnosis, treating physician, and medication list. Overwhelmed by the volume of complaints, she pastes the entire text into ChatGPT and asks it to draft a professional response letter. She does this daily. Over six months, she has entered protected health information (PHI) for hundreds of patients into a system with no Business Associate Agreement, no HIPAA safeguards, and no audit trail. The hospital has no idea this is happening.

Scenario 2: The Junior Associate

A second-year associate at a mid-size law firm is assigned to draft a motion in a complex commercial litigation matter. Under time pressure, he pastes the opposing party's confidential settlement offer — marked "FOR SETTLEMENT PURPOSES ONLY" — into ChatGPT to help restructure his arguments. That settlement figure, along with the names of the parties and the details of the dispute, now resides on OpenAI's servers. Under the Heppner ruling of February 2026, this disclosure to a third party may have waived attorney-client privilege over the entire communication chain.

Scenario 3: The Financial Advisor

A wealth management advisor uses ChatGPT to prepare quarterly portfolio review summaries for high-net-worth clients. Each prompt includes the client's name, account balances, investment positions, risk tolerance notes, and tax situation. She copies the AI-generated summary into a client-facing report. She has never read OpenAI's data retention policy. She does not know that her clients' complete financial profiles have been disclosed to a third party without consent, potentially violating SEC Regulation S-P and state financial privacy statutes.

Scenario 4: The Therapist

A licensed clinical psychologist uses ChatGPT to help draft progress notes after sessions. He enters session summaries that include the patient's name, presenting concerns, trauma history, substance use details, and treatment plan. This is a direct, unencrypted transmission of psychotherapy notes — the most protected category of information under HIPAA — to a commercial AI provider. A single breach report could end his career.

The Regulatory Landscape: What the Law Actually Says

The regulatory consequences of AI data privacy failures are not speculative. They are already being enforced.

HIPAA (Health Insurance Portability and Accountability Act)

HIPAA requires covered entities and business associates to implement safeguards for protected health information. Using ChatGPT to process PHI without a signed Business Associate Agreement is a clear HIPAA violation. OpenAI does not sign BAAs for consumer products. Penalties range from $141 to $2,134,831 per violation, with criminal penalties of up to $250,000 and 10 years imprisonment for knowing misuse.

The HHS Office for Civil Rights issued guidance in 2024 explicitly warning that AI tools processing PHI must comply with the HIPAA Security Rule, including access controls, audit logging, and encryption standards that consumer AI tools do not meet.

The Italian data protection authority (Garante) temporarily banned ChatGPT in 2023 over GDPR concerns, and multiple European DPAs have opened investigations. Under GDPR, entering an EU data subject's personal data into ChatGPT may constitute an unauthorized international data transfer, a failure to provide required transparency notices, and processing without a lawful basis. Fines can reach 4% of global annual revenue or 20 million euros, whichever is higher.

State Privacy Laws

California's CCPA/CPRA, Virginia's CDPA, Colorado's CPA, and a growing patchwork of state laws impose obligations on businesses that process personal information. Many of these laws require specific disclosures about third-party data sharing — disclosures that almost no professional makes when pasting client data into ChatGPT.

The Heppner Ruling (February 2026)

In United States v. Heppner, federal Judge Jed Rakoff held that documents generated using consumer AI tools are not protected by attorney-client privilege when the attorney disclosed confidential client information to the AI provider. The court reasoned that voluntary disclosure to a third party — the AI provider — waives privilege under established doctrine. This ruling has immediate implications for every attorney using consumer AI tools with client data. State bar associations across the country have since issued emergency ethics guidance, with at least 14 states now requiring disclosure of AI tool usage in engagement letters.

The Numbers That Should Keep You Up at Night

68% of knowledge workers in regulated industries report using generative AI tools at work, according to a 2025 McKinsey survey.
44% of those users admit to entering client, patient, or customer data into AI tools, per a 2025 Cisco Data Privacy Benchmark Study.
Only 11% of organizations have formal AI usage policies that address data privacy, according to ISACA's 2025 AI Governance report.
The average cost of a data breach in healthcare reached $10.93 million in 2025, per IBM's Cost of a Data Breach Report.
37% of employees who use AI at work say they have entered information their employer would consider sensitive, per a Salesforce Generative AI Snapshot survey.

The gap between adoption and governance is enormous — and it is your liability.

What to Do Right Now

If you work in a regulated industry and have ever entered client data into ChatGPT, Claude, Gemini, or any other consumer AI tool, take these steps immediately:

Stop entering identifiable client data into consumer AI tools today. This is not an overreaction. It is the minimum standard of care.
Conduct an AI usage audit. Survey your team — associates, paralegals, administrative staff, analysts — to understand who is using AI, which tools they are using, and what data they are entering. The answers will likely surprise you.
Review your data retention. Log into every AI tool your team uses. Check the data retention settings. Delete stored conversations that contain client data. Turn off model training toggles where available.
Update your policies. Draft or update your organization's AI acceptable use policy. Specify which tools are approved, what data categories are prohibited, and what safeguards are required.
Implement privacy-first AI tools. Switch to AI platforms that de-identify data before it leaves your environment. Tools like PrivacyFrom.AI automatically strip names, dates, account numbers, medical record numbers, and 50+ PII entity types before any data reaches the AI model.
Update client disclosures. If you use AI in client-facing work, update your engagement letters, privacy notices, and informed consent forms. Post-Heppner, transparency about AI usage is not optional — it is an ethical obligation.
Train your entire organization. Privacy training should cover not just your professional staff but every person with access to client data, including administrative and support roles.

Why De-Identification Is the Only Structural Solution

Policy alone does not solve this problem. You can tell people not to paste client data into ChatGPT, but they will do it anyway — because the productivity gains are too compelling and the risk feels abstract.

The only structural solution is to remove the identifiable data before it ever reaches the AI model. This is what de-identification does:

Names become reversible tokens: "John Martinez" becomes "[PERSON_1]"
Dates are shifted: "March 5, 2026" becomes "[DATE_1]"
Account numbers, SSNs, medical record numbers are replaced with placeholder tokens
Addresses, phone numbers, email addresses are all stripped automatically

The AI model processes the clean, de-identified text and returns its response. PrivacyFrom.AI then re-inserts the original details on your device — so you get the full, usable output without ever having disclosed a single piece of identifiable information to a third party.

This is not a workaround. It is the architecture that privacy-first AI requires.

Frequently Asked Questions

Is ChatGPT HIPAA compliant?

No. ChatGPT's consumer products (Free and Plus tiers) are not HIPAA compliant. OpenAI does not sign Business Associate Agreements for these products, does not provide the audit controls required by the HIPAA Security Rule, and retains user inputs in ways that are incompatible with HIPAA's minimum necessary standard. OpenAI's Enterprise and API products offer BAAs and enhanced data controls, but the vast majority of healthcare professionals using ChatGPT are on consumer tiers without these protections. Any use of consumer ChatGPT to process protected health information is a potential HIPAA violation carrying penalties of up to $2.13 million per violation category per year.

Can I use ChatGPT safely if I remove names manually before typing?

Manual de-identification is better than no de-identification, but it is unreliable and insufficient. Studies consistently show that humans miss identifiers when redacting manually — particularly indirect identifiers like dates of service, rare diagnoses, job titles, and geographic details that can be combined to re-identify individuals. A 2024 study published in the Journal of the American Medical Informatics Association found that manual redaction missed an average of 17% of identifiers in clinical notes. Automated de-identification tools like PrivacyFrom.AI detect and replace 50+ entity types consistently, without the fatigue and error rates that make manual approaches risky.

What is the Heppner ruling and how does it affect AI use?

In February 2026, federal Judge Jed Rakoff ruled in United States v. Heppner that documents generated using consumer AI tools are not protected by attorney-client privilege. The court held that when an attorney enters confidential client information into a third-party AI system, that constitutes a voluntary disclosure to a third party, which waives privilege under established legal doctrine. The ruling affects any attorney who has used ChatGPT, Claude, Gemini, or similar tools with client data. Since the ruling, at least 14 state bar associations have issued emergency ethics guidance, and many now require attorneys to disclose AI tool usage in engagement letters. The practical impact extends beyond law: the ruling establishes that sending confidential data to an AI provider is legally equivalent to sharing it with any other third party.

Does turning off chat history in ChatGPT protect my data?

No. Turning off chat history in ChatGPT prevents your conversations from appearing in the sidebar and excludes them from model training, but OpenAI still retains the data for up to 30 days for "trust and safety" purposes. During that retention window, your data remains on OpenAI's servers and could be accessed by human reviewers, disclosed in response to legal process, or exposed in a security breach. Disabling chat history is a minimal step, but it does not constitute meaningful data protection for regulated information. The only way to ensure client data is not retained by an AI provider is to prevent identifiable data from reaching the provider in the first place, through automated de-identification before transmission.

The professionals who take AI privacy seriously today will be the ones still standing when the enforcement wave arrives. The ones who don't will be case studies in what not to do.

Your clients trust you with their most sensitive information. That trust does not have an exception for convenience.

Start protecting client data now — see how PrivacyFrom.AI de-identifies every prompt before it ever leaves your device.

All posts