Lesson 10: Local AI Models: The Compliance Shortcut and Its Limits
← Study Guide home · Lesson 10 of 12
The most important emerging topic with no formal IRS guidance
Learning Objectives
After completing this lesson, you will be able to:
- Explain what "local AI" means and how it differs from cloud AI
- Articulate the full legal argument for why local AI may not constitute a "disclosure" under §7216
- Identify the potential counterarguments and evaluate their strength
- Describe what "genuinely local" requires and what it does not eliminate
- Apply the risk comparison framework to local vs. cloud AI scenarios
What "Local AI" Means
"Local AI" refers to AI models where:
- The model weights run on hardware you control: your firm's server, workstation, or laptop
- Inference (processing) occurs on your machine: no query goes to an external server
- Data does not leave your network: prompts, client data, and outputs remain within your infrastructure
This is distinct from:
- Cloud AI (ChatGPT, Claude, Gemini): Your prompt is sent to the vendor's servers; inference happens there; results are returned to you.
- Hybrid/semi-local AI: A tool that runs on your hardware but "phones home" for updates, logs, telemetry, or fallback processing, these are not genuinely local.
Common local AI platforms for practitioners:
- Ollama: Open-source platform for running LLMs locally; supports many models
- LM Studio: User-friendly local model runner with a graphical interface
- GPT4All: Open-source local LLM interface
Common local models:
- Llama 3.1/3.2/3.3 (Meta): Available in sizes from 8B to 70B+ parameters
- Mistral 7B/12B (Mistral AI): Strong performance for size; good for document analysis
- Phi-4 (Microsoft): Small, efficient, runs on modest hardware
- Gemma 2 (Google): Available for local deployment
The §7216 Legal Argument in Full
Here is the complete step-by-step argument for why genuinely local AI does not trigger §7216's disclosure prohibition:
Step 1: IRC §7216(a) prohibits a tax return preparer from "disclos[ing] any information furnished to him for, or in connection with, the preparation of any such return."
Step 2: The definition of "disclosure" in Treas. Reg. §301.7216-1(b)(5) is: "the act of making tax return information known to any person in any manner whatever."
Step 3: The definition turns on the word "person." IRC §7701(a)(1) defines "person" as: "an individual, a trust, estate, partnership, association, company, or corporation." This list is exhaustive for purposes of the Code.
Step 4: Software model weights running on your hardware are none of these. A local AI model is:
- Not an individual (not a human being)
- Not a trust or estate
- Not a partnership, association, company, or corporation
- Not a separate legal entity of any kind
Step 5: Therefore, running a prompt containing client tax data through a local AI model does not make that information "known to any person" within the meaning of §301.7216-1(b)(5).
Step 6: Therefore, the disclosure prohibition is not triggered, and §301.7216-3 consent is not required for the local AI analysis itself.
This is a textual, plain-language reading of existing regulations. It does not rely on any IRS guidance, revenue ruling, or interpretation that is specific to AI, because none exists. It applies the definitions Treasury established in 2009/2012 to a technology that did not exist then.
The Potential Counterarguments and Why They Are Weak
Counterargument 1: "The model company is a person who created the model weights."
This argument would say that because Anthropic, Meta, or Microsoft created the model weights, and those companies are "persons" under §7701(a)(1), using their model constitutes making information known to them.
Why it is weak: The disclosure definition requires making information known to a person. The company that distributed the model weights does not "know" what you do with them after download, particularly if there is no telemetry or reporting back. The act of downloading software that a company created and then using that software on your local hardware does not make information known to the company that wrote the software. Under this argument, using Microsoft Word would constitute a disclosure to Microsoft every time you drafted a client engagement letter. That cannot be the correct reading.
Counterargument 2: "The cloud infrastructure that supports local model distribution is a 'person.'"
This argument notes that even "local" models may initially be downloaded from a cloud repository and may receive updates over the internet.
Why it is weak: The one-time download of model weights from a repository, with no client data transmitted during that download, does not constitute a disclosure. The analysis under §301.7216-1(b)(5) turns on what data is transmitted, not what software was obtained from whom. If no client data ever traverses the network connection to the model vendor, no disclosure occurs.
What "Genuinely Local" Requires
For the local-model §7216 argument to hold, the following must be true:
-
No telemetry with prompt content: The model must not send prompt text, completions, or any content back to the model provider. Many models are entirely local by default; confirm this for any model you deploy.
-
No cloud fallback: The model must not automatically send requests to a cloud endpoint if local processing is slow or fails. Some tools have optional or default cloud fallback, disable it.
-
No hosted RAG (Retrieval-Augmented Generation): If you are using a retrieval-augmented system where documents are uploaded to a cloud vector database, the client data leaving for the cloud server is a disclosure. Local RAG implementations (where the vector database runs on your hardware) do not have this problem.
-
No remote OCR: If OCR (optical character recognition) of scanned documents is handled by a cloud service, those documents, including client data, are being disclosed to that service.
-
Firm network control: The inference must occur on hardware that is within your firm's control, not on a shared network resource operated by a third party.
✅ COMPLIANCE NOTE: Before deploying a "local" AI tool, run it with network monitoring active (e.g., Wireshark or a similar tool) to confirm that no data leaves your network during normal operation. The vendor's documentation is a starting point, but verification is better practice.
What Local Models Do NOT Eliminate
Being on a local model removes the §7216 disclosure concern (under the analysis above) but does not eliminate other compliance obligations:
| Obligation | Does Local AI Eliminate It? | Why |
|---|---|---|
| §7216 use prohibition | No | Using client data for purposes other than return preparation (e.g., training a custom model) is still a "use" violation |
| WISP endpoint controls | No | Your WISP must address AI tool security regardless of deployment model |
| §10.22 due diligence | No | You still must review AI outputs and verify accuracy |
| SSTS §1.4 member responsibility | No | You remain responsible for positions taken on the return |
| §10.35 competence | No | You must understand the local model tool well enough to use it competently |
| §10.36 firm procedures | No | Adequate AI use procedures are required regardless of local vs. cloud |
| ET §1.700.001 | Potentially No | The AICPA confidentiality threat/safeguards analysis must be applied; if the model is genuinely controlled by your firm, threats may be reduced to acceptable levels, but this must be analyzed |
| Georgia Rule 20-12-.11 | Potentially No | May be satisfied if the firm controls the model and data does not leave firm control |
The Honest Guidance Gap Statement
⚠️ GUIDANCE GAP: As of June 2026, no revenue ruling, notice, FAQ, or Chief Counsel memorandum addresses whether local or cloud AI constitutes a "disclosure" under §7216. The local-model conclusion rests entirely on applying 2012-era regulations by their plain terms, software is not a "person" under §7701(a)(1). That is a sound and textually defensible argument, and the IRS has not asserted the contrary. But it does not have IRS blessing, because it does not have IRS blessing. No one should represent to clients, colleagues, or regulators that the IRS has approved or endorsed local-model use as outside §7216. Confirm any reliance on this analysis with counsel or your E&O carrier before building your practice around it.
Risk Comparison Table
| AI Deployment | §7216 Disclosure? | Consent Needed? | WISP Obligation? | Best for... |
|---|---|---|---|---|
| Consumer cloud AI (ChatGPT Free/Plus) | Yes, disclosure occurs | Yes (if no exception) | Yes | General tasks only; NOT for client data |
| Enterprise cloud AI with DPA (e.g., Copilot, ChatGPT Enterprise) | Likely yes, disclosure occurs | Yes, but auxiliary exception possible if non-substantive | Yes, including vendor oversight | Cloud AI + proper paperwork path |
| Cloud AI with enterprise DPA + valid §7216 consent | Disclosure occurs but authorized | Consent already obtained | Yes | Full compliance path with cloud AI |
| Genuinely local AI (no telemetry, no cloud fallback) | Likely no, no "person" receives data | Not triggered under local analysis | Yes | Best §7216 position, but other obligations remain |
Practical: Hardware and Tools
What hardware do you need for local AI?
For models in the 7B–13B parameter range (capable of meaningful tax research support):
- RAM: 16GB minimum for 7B models; 32GB recommended for 13B models
- GPU: A modern NVIDIA GPU with 8GB+ VRAM accelerates inference significantly; CPU-only is possible but slower
- Storage: 4–10 GB per model (quantized versions)
- Operating System: Windows, macOS (Apple Silicon particularly efficient), or Linux
Practical capability for tax practice:
Local models at the 7B–13B parameter level are capable of:
- Drafting client letters and memos (with your review)
- Summarizing documents and source materials
- Formatting and organizing data
- Answering general research questions (with verification)
Local models at this size are not as capable as the best cloud models (GPT-4, Claude 3.7 Sonnet, Gemini 2.0 Ultra) for complex multi-step legal reasoning. The compliance benefit of local deployment comes with a capability tradeoff.
📌 PRACTICE TIP: A reasonable approach for a small firm: use a genuinely local model for all work involving client-specific data (drafting letters, organizing data, summarizing documents), and use cloud AI with appropriate DPAs and consent for research and general tasks that do not involve client-specific information.
Key Takeaways
- "Genuinely local" AI, model weights on your hardware, inference on your machine, no data leaving your network, has a strong textual argument for not constituting a §7216 disclosure, because software running on your hardware is not a "person" under §7701(a)(1).
- The counterarguments (the model vendor is a person) are weak under a plain-text reading of the regulations.
- "Genuinely local" requires no telemetry, no cloud fallback, no hosted RAG, and no remote OCR.
- Local AI eliminates the §7216 disclosure concern but does NOT eliminate WISP obligations, §10.22 due diligence, SSTS member responsibility, or other professional obligations.
- No IRS guidance confirms this analysis. It is sound and defensible, but confirm with counsel or your carrier before relying on it as a compliance strategy.
Quick Review
Q1: You run Ollama on your firm's workstation with the Llama 3.3 70B model. Network monitoring confirms no prompt data leaves your network during inference. You paste a client's full Schedule C data into a local prompt for help drafting a depreciation memo. Is this a §7216 disclosure?
Answer: Under the plain-text legal analysis: No. The information was not made "known to any person" because the model weights running on your workstation are not a "person" under §7701(a)(1). However: (1) this has no IRS guidance confirming it; (2) you must still comply with SSTS §1.4, §10.22, and other professional standards; and (3) confirm with counsel before relying on this as your compliance strategy.
Q2: You use LM Studio to run a local model. You discover that LM Studio has an optional telemetry setting that was enabled by default, which sends usage statistics, but not prompt content, to the LM Studio servers. Does this affect the §7216 analysis?
Answer: If only usage statistics (not prompt content) are sent, the §7216 analysis is unchanged, the client's tax return information has not been made known to another person. However, confirm what "usage statistics" includes in the vendor's documentation. If any portion of the prompt, client name, or identifying information could be included in telemetry data, you have a potential disclosure issue.
Q3: You run a local Ollama instance for drafting and research. However, you use a cloud-hosted vector database service to store and search your client files for retrieval-augmented generation. Is this setup "genuinely local"?
Answer: No. The cloud-hosted vector database is a third-party server receiving your client files. Uploading client files to the vector database is a disclosure under §301.7216-1(b)(5). This setup is not genuinely local from a §7216 perspective.
Q4: True or false: A local AI model eliminates the need for your firm to have AI-related procedures in its WISP.
Answer: False. The WISP must address all AI tools used in the practice, including local models. Security risks associated with local models (endpoint security, access controls, physical security of the workstation, data retention) are all WISP concerns regardless of whether the model is local or cloud-based.
Q5: A local AI model summarizes a client's K-1 incorrectly, attributing ordinary income as capital gain. The CPA does not review the summary carefully and files the return with the incorrect characterization. The CPA's defense is that the local model is not a "person" under §7216, so there was no unauthorized disclosure and therefore no violation. Evaluate this defense.
Answer: The defense misses the point. The §7216 disclosure analysis may support the local model position, but the issue here is not §7216, it is §10.22 (due diligence), SSTS §1.4.4 (member responsibility regardless of tool use), and potentially §10.35 (competence). The local-AI §7216 argument is only relevant to the disclosure element. It does not excuse the practitioner from reviewing AI outputs, verifying accuracy, and applying professional judgment. The CPA has failed to meet professional standards regardless of the §7216 analysis.
← Lesson 9: State Law Layer · Study Guide home · Lesson 11: Putting It All Together →