Several readers asked me whether they could use AI to calculate taxes after reading my previous post AI Gives Better Answers Than Google. They want a tax projection to help with setting tax withholding, paying estimated taxes, or planning for Roth conversions.
My first reaction was that calculating taxes isn’t the best use of AI, because it falls under “verifiable facts” and “latest development” categories. The retirement plan contribution limits, tax brackets, the maximum deduction amounts, etc., are all verifiable facts. The IRS sets them every year, and they are what they are. Just go to the IRS website for the latest numbers or Google. If you ask AI, it had better get the latest numbers online anyway, because AI’s training lags.
On the other hand, tax rules are complicated. Even when you have all the latest numbers from the IRS, you still need to apply the complex rules — what counts and what doesn’t, and which rates apply to which income. Online tax calculators usually don’t cover the current year until late in the year, and they can be either too simple or too complicated. It would be nice to have AI calculate taxes specifically for the types of income and deductions we have: not too simple with only limited inputs, and not too complicated with everything under the sun.
The Quiz
I thought I would test how well AI calculates taxes. I came up with this quiz question:
Jill, single, age 63, has these incomes in 2026:
- $30,000 from Social Security
- $10,000 from interest and pre-tax IRA withdrawals
- $37,000 from qualified dividends and long-term capital gains
- $1,000 from muni bond interest
Jill contributes $4,400 to her HSA and donates $1,500 in cash to charities. What’s Jill’s federal income tax in 2026?
I designed this question carefully to cover several calculations. Less than 85% of Social Security is taxable. The income consists of ordinary income, tax-exempt income, and investment income taxed at preferential rates. Ordinary income goes across two tax brackets. So does investment income. One deduction is above-the-line, and the other is below-the-line.
I added these instructions to encourage AI to get the latest tax numbers from the IRS:
It’s important to calculate it accurately. Please use tax amounts only for 2026 and only from official IRS sources.
Initial Results
I sent the quiz to all four major AI chatbots. I only used the free version in each one, with the free Thinking, Pro, or Expert mode enabled. I compared the answers to the correct result from my calculator in Calculator: How Much of My Social Security Benefits Is Taxable?
| Correct Answer | $1,640 |
| ChatGPT 5.4 Thinking Mini | $325 |
| Gemini 3 Pro | $1,910 |
| Claude Sonnet 4.6 Extended | $1,910 |
| Grok 4.20 Expert | $1,910 |
My quiz was too hard! All four major AI models failed to give the correct answer. The $1,500 cash donation threw them off. It falls squarely in the “latest development” category. The charity donation deduction for non-itemizers is new. The AI models all have “donations are deductible only when you itemize” imprinted in their training. Gemini, Claude, and Grok all would have given the correct answer if I hadn’t included the cash donation. ChatGPT was off on how much of Social Security is taxable.
When you see AI not giving you a deduction, you can point it out. I followed up with this hint:
I heard that cash donations are deductible for people using the standard deduction, starting in 2026, up to a limit.
I also added this for ChatGPT:
Another AI model calculated a different amount for how much Social Security is taxable. Please double-check your calculation to see who’s right.
Round 2
All chatbots double-checked and revised their answers.
| Correct Answer | $1,640 |
| ChatGPT 5.4 Thinking Mini | $1,640 ✅ |
| Gemini 3 Pro | $1,411 |
| Claude Sonnet 4.6 Extended | $1,640 ✅ |
| Grok 4.20 Expert | $1,411 |
ChatGPT and Claude got the correct answer. Gemini and Grok treated the cash donation deduction as above-the-line and used it to reduce the Social Security taxable amount.
I asked Gemini and Grok to clarify which item should be included in the calculation and which shouldn’t be. Gemini didn’t realize its mistake. Grok refused to answer because I reached the message limit for not having an account.
Here are the full chat transcripts if you’re interested:
https://chatgpt.com/share/69bf73fb-db44-8004-9905-b77b56fbb6ce
https://gemini.google.com/share/795bdad9f5ad
https://claude.ai/share/ef62408a-6502-4d70-82be-6ea46a4a4cc3
[No sharable link for Grok because I don’t have an account.]
Impressions
I don’t think we can draw definitive conclusions based on only one test. I call these impressions.
1. Whether AI can calculate taxes correctly depends on the complexity. 3 out of 4 major chatbots would’ve given the correct answer in one go if the question didn’t include the new and tricky charity donation deduction.
2. AI works better when you continue the conversation. Both ChatGPT and Claude reached the correct answer after I asked them to double-check.
3. It takes little effort to send the same question to two or more AI models. You don’t need to know which AI is right when you get different answers. You can tell one AI that another AI said something different. It’ll take a second pass and re-examine.
4. AI was wrong because human sources were wrong. Gemini and Grok treated the donation deduction as above-the-line because many human sources incorrectly called it above-the-line. If you Googled, you would encounter those incorrect sources too. ChatGPT and Claude used more reliable sources and correctly identified the difference between above-the-line and not requiring itemizing deductions.
5. An online tax calculator is faster and more accurate if you know which one to use. That’s a big “if.” Only saying “Don’t use AI because it’s often wrong” doesn’t say where you will find more reliable sources.
As much as I’d like to see everyone use my tax calculator, let’s face it: 99.99% of people don’t know that I exist. They have little chance to find the few sources that do it accurately because Google doesn’t rank tax calculators by accuracy. Short of finding the good sources, the results from AI aren’t that bad (see #1).
6. Even when AI was wrong in some nuances, its general approach was correct. All 4 AI chatbots followed these correct steps:
- Calculate how much of Social Security is taxable
- Calculate AGI
- Calculate taxable income
- Separate ordinary income and preferential investment income, and know how they stack
- Apply different tax brackets
These are what I call “common knowledge of an insider.” You can learn these steps from AI if you’re not familiar with tax calculations. That’s more valuable than just having a final number. An online tax calculator gives you a number but doesn’t explain the steps. AI is a better tool if you’re more interested in learning how to fish than getting a fish.
Learn the Nuts and Bolts
I put everything I use to manage my money in a book. My Financial Toolbox guides you to a clear course of action.

Leave a Reply