Editor’s observe: This submit is a part of the Nemotron Labs weblog collection, which explores how the most recent open fashions, datasets and coaching strategies assist companies construct specialised AI methods and purposes on NVIDIA platforms. Every submit highlights sensible methods to make use of an open stack to ship worth in manufacturing — from clear analysis copilots to scalable AI brokers.
Companies as we speak face the problem of uncovering useful insights buried inside all kinds of paperwork — together with stories, shows, PDFs, internet pages and spreadsheets.
Usually, groups piece collectively insights by manually reviewing recordsdata, copying knowledge into spreadsheets, constructing dashboards and utilizing fundamental search or template-based optical character recognition (OCR) instruments that usually miss vital particulars in advanced media.
Clever doc processing is an AI-powered workflow that routinely reads, understands and extracts insights from paperwork. It interprets wealthy codecs inside these paperwork — together with tables, charts, pictures and textual content — utilizing AI brokers and strategies like retrieval-augmented era (RAG) to show the multimodal content material into insights that different multi-agent methods and folks can simply use.
With NVIDIA Nemotron open fashions and GPU-accelerated libraries, organizations can construct AI-powered doc intelligence methods for analysis, monetary companies, authorized workflows and extra.
These open fashions, datasets and coaching recipes have powered robust outcomes on leaderboards comparable to MTEB, MMTEB and ViDoRe V3, benchmarks for evaluating multilingual and multimodal retrieval fashions. Groups can select from among the many finest fashions for duties like search and query answering.
How Doc Processing Streamlines Enterprise Intelligence
Doc intelligence methods that may pull which means from advanced layouts, scale to large file libraries and present precisely the place a solution got here from are extremely helpful in high-stakes environments. These methods:
- Perceive wealthy doc content material, transferring past easy textual content scraping to seize data from charts, tables, figures and mixed-language pages and treating paperwork as a human would by recognizing construction, relationships and context.
- Deal with giant portions of shifting knowledge, ingesting and processing huge collections of paperwork in parallel, and protecting data bases repeatedly updated.
- Discover precisely what customers want, serving to AI brokers pinpoint essentially the most related passages, tables or paragraphs to a question to allow them to reply with precision and accuracy.
- Present the proof behind solutions by offering citations to particular pages or charts so groups can acquire transparency and auditability, which is important in regulated industries.

The result’s a shift from static doc archives to dwelling data methods that immediately energy enterprise intelligence, buyer experiences and operational workflows.
Doc Intelligence at Work
Clever doc processing methods constructed on NVIDIA Nemotron RAG fashions, Nemotron Parse and accelerated computing are already reshaping how organizations throughout industries acquire insights from their paperwork.
Justt: AI-Native Chargeback Administration and Dispute Optimization
In monetary companies, cost disputes create vital income loss and operational complexity for retailers, largely as a result of the proof wanted to deal with them lives in unstructured codecs. Transaction logs, buyer communications and coverage paperwork are sometimes fragmented throughout methods and tough to course of at scale, making dispute dealing with gradual, guide and expensive.
Justt.ai gives an AI-driven platform that automates the complete chargeback lifecycle at scale. The platform connects on to cost service suppliers and service provider knowledge sources to ingest transaction knowledge, buyer interactions and insurance policies, then routinely assembles dispute-specific proof that aligns with card community and issuer necessities.
The platform’s AI-powered dispute optimization, powered by Nemotron Parse, applies predictive analytics to find out which chargebacks to combat or settle for, and methods to optimize every response for max web restoration. Main hospitality operators like HEI Resorts & Resorts use the platform to automate dispute dealing with throughout their properties, recapturing income whereas sustaining visitor relationships.
By pairing document-centric intelligence with choice automation, retailers can recapture a good portion of income misplaced to illegitimate chargebacks whereas lowering guide assessment effort.
Docusign: Scaling Settlement Intelligence
Docusign is the worldwide chief in Clever Settlement Administration, dealing with tens of millions of transactions day by day for greater than 1.8 million prospects and over 1 billion customers.
Agreements are the inspiration of each enterprise, however the important data they comprise are sometimes buried inside pages of paperwork. To floor the knowledge, Docusign wanted high-fidelity extraction of tables, textual content and metadata from advanced paperwork like PDFs so organizations might perceive and act on obligations, dangers and alternatives sooner.
Docusign is evaluating Nemotron Parse for deeper contract understanding at scale. Operating on NVIDIA GPUs, the mannequin combines superior AI with structure detection and OCR. The system can reliably interpret advanced tables and reconstruct tables with required data. This reduces the necessity for guide corrections and helps make sure that even essentially the most advanced contracts are processed with the pace and accuracy their prospects anticipate.
With this basis, Docusign will rework settlement repositories into structured knowledge that powers contract search, evaluation and AI-driven workflows — turning agreements into enterprise property that assist organizations and their groups enhance visibility, scale back threat and make sooner choices.
Edison Scientific: Analysis Throughout Large Literature Scale
Edison Scientific’s Kosmos AI Scientist helps researchers navigate advanced scientific landscapes to synthesize literature, establish connections and floor proof.
Edison wanted a strategy to quickly and precisely extract structured data from giant volumes of PDFs, together with equations, tables and figures that conventional data parsing strategies usually mishandle.
By integrating the NVIDIA Nemotron Parse mannequin into its PaperQA2 pipeline, Edison can decompose analysis papers, index key ideas and floor responses in particular passages, enhancing each throughput and reply high quality for scientists. This strategy turns a sprawling analysis corpus into an interactive, queryable data engine that accelerates speculation era and literature assessment.
The excessive effectivity of Nemotron Parse allows cost-efficient serving at scale, permitting Edison’s workforce to unlock the entire multimodal pipeline.
Designing an Clever Doc Processing Software With NVIDIA Applied sciences
A strong, domain-specific doc intelligence pipeline requires applied sciences that may deal with knowledge extraction, embedding and reranking, whereas protecting the information safe and compliant with rules.
- Extraction: Nemotron extraction and OCR fashions quickly ingest multimodal PDFs, textual content, tables, graphs and pictures to transform them into structured, machine-readable content material whereas preserving structure and semantics.
- Embedding: Nemotron embedding fashions convert passages, entities and visible components into vector representations tuned for doc retrieval, enabling semantically correct search.
- Reranking: Nemotron reranking fashions consider candidate passages to make sure essentially the most related content material is surfaced as context for giant language fashions (LLMs), enhancing reply constancy and lowering hallucinations.
- Parsing: Nemotron Parse fashions decipher doc semantics to extract textual content and tables with exact spatial grounding and proper studying movement. Overcoming structure variability, they flip unstructured paperwork into actionable knowledge that enhances the accuracy of LLMs and agentic workflows.
These capabilities are packaged as NVIDIA NIM microservices and basis fashions that run effectively on NVIDIA GPUs, permitting groups to scale from proof of idea to manufacturing whereas protecting delicate knowledge inside their chosen cloud or knowledge heart atmosphere.
The best AI methods use a mixture of frontier fashions and open supply fashions like NVIDIA Nemotron, with an LLM router analyzing every job and routinely choosing the mannequin finest suited to it. This strategy retains efficiency robust whereas managing computing prices and enhancing effectivity.
Get Began With NVIDIA Nemotron
Entry a step-by-step tutorial on methods to construct a doc processing pipeline with RAG capabilities. Discover how Nemotron RAG can energy specialised brokers tailor-made for various industries.
Plus, experiment with Nemotron RAG fashions and the NVIDIA NeMo Retriever open library, accessible on GitHub and Hugging Face, in addition to Nemotron Parse on Hugging Face.
Be a part of the group of builders constructing with the NVIDIA Blueprint for Enterprise RAG — trusted by a dozen industry-leading AI Knowledge Platform suppliers and accessible now on construct.nvidia.com, GitHub and the NGC catalog.
Keep updated on agentic AI, NVIDIA Nemotron and extra by subscribing to NVIDIA AI information, becoming a member of the group and following NVIDIA AI on LinkedIn, Instagram, X and Fb.

