Earnings calls and financial document analysis: institutional AI on a meter
· Avery NXR
Institutional finance has been quietly transformed by AI in the past few years.
Equity research teams use AI to summarize every earnings call across their coverage universe. Corporate finance teams use AI to parse SEC filings, analyst reports, and competitor disclosures. Asset managers use AI to scan macroeconomic releases, central bank communications, and policy documents. Investment banks use AI to read every M&A announcement, every credit memo, every deal document that crosses their pipeline.
The volume is industrial. The data is sometimes the most sensitive in the firm. The bill is, in the cloud-LLM-default architecture, larger than people realize.
The volume problem
A representative equity research desk covers a hundred to three hundred companies. Each company in the coverage universe produces a substantial volume of analyzable content per quarter: quarterly earnings call (an hour of dialogue, fifty pages of transcript), 10-Q filings (a hundred pages of structured disclosure), analyst reports from other firms, news releases, conference presentations.
Across the coverage universe, this is thousands of long documents per quarter. The AI workflow on each document is non-trivial: read, summarize, extract structured data (revenue beats/misses, guidance changes, competitive mentions), flag interesting quotes, generate research-note drafts.
A reasonable token budget per document is fifty thousand input tokens (the document plus context) and three thousand output tokens. At frontier pricing, about $0.20 per document. Across two thousand documents per quarter, that's $400 per quarter — small.
But this is the analyst-facing layer. The data-and-signals layer underneath, which runs across many more documents at higher frequency, is where the volume gets serious. A quant team running AI across every SEC filing for a broad equity universe is processing tens of thousands of documents per month, with much higher token counts. The bill at a serious quant shop is in the high six figures per year.
For corporate development teams at large public companies, the workload is smaller but the privacy stakes are higher. Reading competitor disclosures, M&A target analyses, board memos — the volume is in the hundreds per month, the bill is in the low five figures per year, but the documents are confidential in ways that cloud-LLM-default architectures struggle to defend.
Why financial analysis is a strong local-SLM workload
The properties are all present, with several that are unusually strong in finance.
The work is narrow in domain. Financial language, accounting terminology, sector-specific patterns — all of this is specialized. A model fine-tuned on financial documents will outperform a general model on financial work.
The work is repetitive in structure. Earnings calls follow predictable patterns. SEC filings have predictable sections. Analyst reports have predictable formats. Specialization compounds across thousands of documents.
The privacy and information-handling stakes are high. Material non-public information (MNPI) constraints govern how research-and-trading firms handle their internal analysis. Sending internal analyst drafts and target lists to a third-party cloud LLM creates information-handling risk that most compliance teams take seriously. For investment banking, the constraints are even tighter — every deal team has Chinese walls, every document touches confidentiality obligations, and "sent to a third-party AI provider" is not a phrase most general counsels want appearing in any audit trail.
The latency story matters in market-relevant workflows. When an earnings call ends, the AI summary that lands in front of the portfolio manager in two hundred milliseconds is more useful than the one that lands in three seconds. Across thousands of earnings events per quarter, the latency advantage compounds into market impact.
The volume scales with the firm's coverage and activity, which scales with the firm's growth and the market's activity. The cloud LLM bill never stops growing.
What the architecture looks like
A financial document analysis workflow on a local SLM has a structure like this.
A model is fine-tuned on financial documents — historical earnings calls, SEC filings, analyst reports, the firm's own research output. The fine-tune captures financial vocabulary, sector-specific patterns, and the firm's specific analytical style.
The model runs on infrastructure the firm controls. For asset managers, that's typically on-premises or in a dedicated private cloud with strict access controls. For investment banks, it's behind the Chinese wall infrastructure that already exists for handling deal information.
Documents flow through the inference pipeline. Earnings calls get summarized within seconds of the call ending. SEC filings get analyzed within minutes of release. Internal documents get processed without ever leaving the firm's controlled environment.
The cost flips from per-document to fixed. Volume can grow as the firm's coverage expands, without the bill moving.
The information-handling story improves. The firm can demonstrate to its compliance officers, its regulators, and its general counsel that the analysis happens inside the firm's controlled environment. This isn't a marketing point — it is, for many firms, a precondition for using AI in research workflows at all.
What gets better, beyond cost
The under-told story in financial AI is the domain-specialization quality improvement.
A general-purpose cloud LLM, summarizing an earnings call, produces a competent summary. But it doesn't know the firm's specific analytical priorities. It doesn't know which competitors matter for this name. It doesn't know the firm's house view on the sector. It doesn't know how this management team has historically over- or under-promised.
A fine-tuned local model knows. Across many cycles of research output, it has learned which questions the firm tends to ask, which signals the firm tends to weight, which framings the firm tends to use. The summaries are more useful because they are tuned to the firm's analytical lens.
For a portfolio manager processing dozens of earnings summaries per week, the quality improvement is real. The summaries are usable from the first read; the editorial pass shrinks; the time-to-decision shortens.
When the cloud LLM is still defensible
A few cases where cloud-LLM-based financial workflows are still the right answer.
For brand-new coverage areas or new fund strategies where the firm doesn't have enough historical research to fine-tune on. The cloud LLM's breadth compensates for the missing training data, for the first six to twelve months.
For one-off analytical projects where the volume doesn't justify the local infrastructure investment. The threshold is lower in finance than in some other categories because the per-document token counts are high — even a few hundred documents per month can justify the investment for a serious firm.
For workflows that explicitly require frontier-model reasoning capabilities — say, novel financial modeling that combines documents from many domains. A frontier model's breadth helps here, and the local model may not generalize well.
For everything else — the recurring, coverage-driven, MNPI-handling-relevant financial document analysis at any serious firm — the local-SLM case is strong, and the compliance case makes it close to mandatory.
The pattern, in finance
Avery NXR is not a financial analysis tool. It scaffolds Next.js applications. The architectural pattern repeats.
Financial document analysis is a narrow, repetitive, high-volume, information-handling-sensitive, latency-relevant workload. The economics that favor a specialized local model for code scaffolding are the same economics that favor a specialized local model for financial analysis. The compliance and information-handling story makes the case unusually strong in this domain.
The financial AI vendors that build excellent vertical tools on local infrastructure — with appropriate fine-tuning, deployment models that fit existing compliance frameworks, and evidence packages that satisfy auditors and regulators — will own this category. The cloud-LLM-default products will hold a portion of the market, but the institutional segment will pivot fast as soon as the tooling matures.
We expect this shift to happen relatively quickly in finance, because the compliance pressure compounds with the cost pressure, and the latency pressure compounds with both. Firms that move first will be ahead on cost, ahead on compliance, and ahead on speed-to-decision simultaneously.