Document AI

Every document, structured. Locally.

Local NVIDIA OCR pipeline turning documents into structured data

Their documents were leaking through a managed parser and the structured outputs were rarely clean. We deployed Nemotron OCR and NemoRetriever locally on their GPUs and shipped clean structured data with zero egress.

Data egress

near zero

Marginal cost

+18%

Throughput

The challenge

The team was running every document through a managed SaaS parser. Tables came back broken. Signatures and form fields needed manual cleanup on almost every page. Compliance flagged the data leaving the building, every single time. Marginal cost per page kept rising as volume grew, and the SaaS roadmap did not match what the team actually needed.

What we did

Built a local pipeline on the client's GPU cluster. Nemotron OCR v1 for high accuracy text extraction. NemoRetriever OCR for layout and table structure. Nemotron Nano 12B VL on top for visual reasoning over the harder pages, signatures, and forms. The pipeline emits structured JSON against the schemas the team already used downstream, so nothing in the case management system had to change. All NVIDIA. All on their hardware. No data leaves the cluster.

The outcome

Tables came out right. Signatures and form fields stopped needing human cleanup at the rate they used to. Marginal cost per page dropped to electricity. Compliance signed off because the data never leaves the building. Throughput surpassed the previous SaaS within two weeks of going live.

Next caseReplacing a bolted on SaaS AI with an in house ops platform