All work
Document AI

Every document, structured. Locally.

Local NVIDIA OCR pipeline turning documents into structured data

Their documents were leaking through a managed parser and the structured outputs were rarely clean. We deployed Nemotron OCR and NemoRetriever locally on their GPUs and shipped clean structured data with zero egress.

0
Data egress
near zero
Marginal cost
+18%
Throughput

The team was running every document through a managed SaaS parser. Tables came back broken. Signatures and form fields needed manual cleanup on almost every page. Compliance flagged the data leaving the building, every single time. Marginal cost per page kept rising as volume grew, and the SaaS roadmap did not match what the team actually needed.

Built a local pipeline on the client's GPU cluster. Nemotron OCR v1 for high accuracy text extraction. NemoRetriever OCR for layout and table structure. Nemotron Nano 12B VL on top for visual reasoning over the harder pages, signatures, and forms. The pipeline emits structured JSON against the schemas the team already used downstream, so nothing in the case management system had to change. All NVIDIA. All on their hardware. No data leaves the cluster.

Tables came out right. Signatures and form fields stopped needing human cleanup at the rate they used to. Marginal cost per page dropped to electricity. Compliance signed off because the data never leaves the building. Throughput surpassed the previous SaaS within two weeks of going live.