Capabilities and Limitations
Capabilities and Limitations
Knowing what LLMs can and cannot do helps you set realistic expectations and design better workflows.
What LLMs Do Well
Text Generation and Transformation
- Writing drafts, summaries, and translations
- Reformatting data between structures
- Generating variations of content
Analysis and Extraction
- Identifying patterns in text
- Extracting specific information from documents
- Classifying content into categories
Reasoning (With Caveats)
- Following multi-step logical arguments
- Solving math problems (especially with chain-of-thought prompting)
- Making comparisons and evaluations
Code
- Writing, explaining, and debugging code
- Translating between programming languages
- Generating tests and documentation
What LLMs Cannot Do Reliably
Factual Accuracy
Models can state incorrect information with total confidence. They do not have a "knowledge database" — they predict likely text. Always verify facts, especially numbers, dates, names, and citations.
Real-Time Information
Models have a knowledge cutoff. They do not know about events after their training data ends. They cannot access the internet unless specifically connected to search tools.
Perfect Consistency
The same prompt can produce different outputs each time. This is by design (controlled by temperature settings), but it means you cannot expect identical results from identical inputs.
Long-Form Coherence
Models can lose track of requirements, instructions, or earlier content in very long outputs. Breaking complex tasks into steps improves consistency.
Counting and Precise Math
Models often make errors with exact counting, complex arithmetic, and precise numerical reasoning. Use code execution tools for anything requiring exact math.
The 80/20 Rule of LLMs
For most business tasks, an LLM can get you 80% of the way to a good result very quickly. The remaining 20% — verification, refinement, and edge cases — requires human judgment. Design your workflows around this reality rather than expecting perfection.
Practical Guidelines
| Task Type | LLM Reliability | Best Practice |
|---|---|---|
| Drafting text | High | Use as a starting point, edit for accuracy |
| Data extraction | Medium-High | Verify extracted data against source |
| Factual questions | Medium | Cross-reference with authoritative sources |
| Complex reasoning | Medium | Use chain-of-thought prompting |
| Precise calculations | Low | Use code execution instead |
| Real-time information | None | Connect to search or database tools |