Text Analytics of PDF Technical Documents

Customer Challenge

The Air Force required a logistics data crosswalk to mitigate known maintenance and supply data connection challenges limiting accurate demand planning and forecasting.

Innovative Solution

ILW data scientists used natural language processing (NLP) and unsupervised machine learning (ML) techniques to evaluate and determine an automated method to tie Work Unit Code (WUC) to related National Item Identification Numbers (NIIN). They used information extracted from Technical Orders in native PDF format as well as data captured in maintenance and supply data systems.

Benefits/Outcomes

  • Extracted master parts list (MPL) for two Air Force weapon system programs
  • Developed multiple table extraction techniques that read PDF documents and pull tabular information out with high degrees of accuracy. Techniques leverage and improve open-source libraries
  • Provide enterprise search capability of Air Force technical documents

Business Value

  • Improves parts supportability, contract lead times, integrated repair planning
  • Enables planning for predictable shifts in demands and condemnations, buying the right quantities of the right parts, avoiding overbuy on other parts

Toolbox

  • Open-source Python solution using DoD-compatible libraries: Pandas, Tabula and Fitz, Scikit-learn, and OpenCV
  • Native PDFs
  • Text Analytics, NLP, Machine Learning, Computer Vision

Related Case Studies You May Like

Operationalizing Digital Depot Transformation at Enterprise Scale (Air Force)

Operationalizing Digital Depot Transformation at Enterprise Scale (Air Force)

Unlocking Engineering Data to Accelerate Sustainment & Manufacturing Decisions (Air Force)

Unlocking Engineering Data to Accelerate Sustainment & Manufacturing Decisions (Air Force)

From Machine Signals to Maintenance Decisions: Enabling Predictive Sustainment at Scale (Air Force)

From Machine Signals to Maintenance Decisions: Enabling Predictive Sustainment at Scale (Air Force)

Turning Cybersecurity Compliance into AI-Driven Audit Readiness & Decision Support (Air Force)

Turning Cybersecurity Compliance into AI-Driven Audit Readiness & Decision Support (Air Force)

Agentic AI & RAG for Cybersecurity (Air Force)

Agentic AI & RAG for Cybersecurity (Air Force)

Agentic AI Natural Language Reasoning (Air Force)

Agentic AI Natural Language Reasoning (Air Force)

Accelerating Contract Negotiations Through Automated Price Analysis & Documentation (Air Force)

Accelerating Contract Negotiations Through Automated Price Analysis & Documentation (Air Force)

Forecasting Aircraft Availability & Maintenance Demand to Improve Readiness (Air Force)

Forecasting Aircraft Availability & Maintenance Demand to Improve Readiness (Air Force)

Enabling What-If Scenario Planning for Predictive Logistics Decisions (Air Force)

Enabling What-If Scenario Planning for Predictive Logistics Decisions (Air Force)

ML/AI Object Tracking Model (Army)

ML/AI Object Tracking Model (Army)

Enabling Predictive Maintenance for Mission-Critical Missile Systems (Navy)

Enabling Predictive Maintenance for Mission-Critical Missile Systems (Navy)

From Parts Data to Print Decisions: Scaling Additive Manufacturing in the OIB (Army)

From Parts Data to Print Decisions: Scaling Additive Manufacturing in the OIB (Army)

Automating Data Rights Verification to Reduce Program Risk & Accelerate Acquisition Decisions (Air Force)

Automating Data Rights Verification to Reduce Program Risk & Accelerate Acquisition Decisions (Air Force)

Delivering Resilient, Real-Time Analytics at the Tactical Edge (Navy)

Delivering Resilient, Real-Time Analytics at the Tactical Edge (Navy)

Data Science & Architecture Assessment (Marketing)

Data Science & Architecture Assessment (Marketing)

Text Analytics of PDF Technical Documents (Air Force)

Text Analytics of PDF Technical Documents (Air Force)

Turning Raw Behavioral Data into Predictive Customer Insight with Deep Learning (Retail)

Turning Raw Behavioral Data into Predictive Customer Insight with Deep Learning (Retail)

Automated Data Cleansing with Machine Learning (Navy)

Automated Data Cleansing with Machine Learning (Navy)

Automated Data Capture and Prediction (Air Force)

Automated Data Capture and Prediction (Air Force)

Automated Data Crosswalks (Air Force SBIR)

Automated Data Crosswalks (Air Force SBIR)

Transforming Contracts into Enterprise Insight to Accelerate Acquisition Decisions (Air Force)

Transforming Contracts into Enterprise Insight to Accelerate Acquisition Decisions (Air Force)

Decision Support for Cyber Hygiene (Air Force)

Decision Support for Cyber Hygiene (Air Force)

Turning Complex Financial Rules into Automated, Scalable Decision Making (Insurance)

Turning Complex Financial Rules into Automated, Scalable Decision Making (Insurance)

Delivering On-Demand Sustainment Insights for Faster, Data-Driven Decisions (Air Force)

Delivering On-Demand Sustainment Insights for Faster, Data-Driven Decisions (Air Force)

Predicting Contract Performance Risks to Enable Proactive Acquisition Management (Air Force)

Predicting Contract Performance Risks to Enable Proactive Acquisition Management (Air Force)

Machine Learning & NLP for Decision Support (Healthcare)

Machine Learning & NLP for Decision Support (Healthcare)

Turning Raw Data into Scalable, Analytics-Ready Pipelines for Data Science & AI (Energy)

Turning Raw Data into Scalable, Analytics-Ready Pipelines for Data Science & AI (Energy)

Interested In Working With Us?