Automated Data Cleansing with Machine Learning

Customer Challenge

Poor data quality is hindering the Department of Navy’s (DON) ability to gain valuable and accurate insight from their data. Given the volume of errors, manual correction is ineffective and inefficient.

Innovative Solution

ILW data scientists implemented Phase I of our ADCATâ„¢ solution for automated data cleansing and analysis, which applies machine learning (ML) and probabilistic graphical modeling (PGM) to automatically cleanse DON data of errors. For Phase II, ILW applied algorithm enhancements, optimization, model quality monitoring, and user interface creation for improved healing functionality across domains as well as deployed ADCAT to a DON production environment.

Benefits/Outcomes

  • Robust natural language processing (NLP) and ML classifier models, achieve 96 – 99.8% accuracy
  • ADCAT’s PGMs provide end-users with the five most probable corrections for a given error; 98% of the time the correct value was in the top five most probable values
  • Exposes black box of ML error correction logic by providing transparent, human-understandable explanations
  • Scalable processes and automatic discovery methods enable new error correction models to be built quickly
  • Human-in-the-loop solution is available to enable review and validation of the ML-driven error corrections

Business Value

  • Improved analyst productivity: less time correcting data, increased focus on core mission tasks
  • Higher quality data: higher-confidence, data-informed decisions, cost savings

Toolbox

  • Supervised/unsupervised ML
  • Probabilistic graphical models (Bayesian Networks)
  • Natural language processing
  • Open-source Python solution using DoD-compatible libraries

Domain Expertise

  • NAVAIR maintenance data
  • NAVSEA labor data

Related Case Studies You May Like

Operationalizing Digital Depot Transformation at Enterprise Scale (Air Force)

Operationalizing Digital Depot Transformation at Enterprise Scale (Air Force)

Unlocking Engineering Data to Accelerate Sustainment & Manufacturing Decisions (Air Force)

Unlocking Engineering Data to Accelerate Sustainment & Manufacturing Decisions (Air Force)

From Machine Signals to Maintenance Decisions: Enabling Predictive Sustainment at Scale (Air Force)

From Machine Signals to Maintenance Decisions: Enabling Predictive Sustainment at Scale (Air Force)

Turning Cybersecurity Compliance into AI-Driven Audit Readiness & Decision Support (Air Force)

Turning Cybersecurity Compliance into AI-Driven Audit Readiness & Decision Support (Air Force)

Agentic AI & RAG for Cybersecurity (Air Force)

Agentic AI & RAG for Cybersecurity (Air Force)

Agentic AI Natural Language Reasoning (Air Force)

Agentic AI Natural Language Reasoning (Air Force)

Accelerating Contract Negotiations Through Automated Price Analysis & Documentation (Air Force)

Accelerating Contract Negotiations Through Automated Price Analysis & Documentation (Air Force)

Forecasting Aircraft Availability & Maintenance Demand to Improve Readiness (Air Force)

Forecasting Aircraft Availability & Maintenance Demand to Improve Readiness (Air Force)

Enabling What-If Scenario Planning for Predictive Logistics Decisions (Air Force)

Enabling What-If Scenario Planning for Predictive Logistics Decisions (Air Force)

Turning Financial Data into Data-Driven Budget Planning & Forecasting (Air Force)

Turning Financial Data into Data-Driven Budget Planning & Forecasting (Air Force)

From Machine Data to Mission Decisions: Enterprise Data Activation for Sustainment (Air Force)

From Machine Data to Mission Decisions: Enterprise Data Activation for Sustainment (Air Force)

Turning Location Data into Actionable Market & Customer Insight (eCommerce/Retail)

Turning Location Data into Actionable Market & Customer Insight (eCommerce/Retail)

ML/AI Object Tracking Model (Army)

ML/AI Object Tracking Model (Army)

Enabling Predictive Maintenance for Mission-Critical Missile Systems (Navy)

Enabling Predictive Maintenance for Mission-Critical Missile Systems (Navy)

From Parts Data to Print Decisions: Scaling Additive Manufacturing in the OIB (Army)

From Parts Data to Print Decisions: Scaling Additive Manufacturing in the OIB (Army)

Automating Data Rights Verification to Reduce Program Risk & Accelerate Acquisition Decisions (Air Force)

Automating Data Rights Verification to Reduce Program Risk & Accelerate Acquisition Decisions (Air Force)

Delivering Resilient, Real-Time Analytics at the Tactical Edge (Navy)

Delivering Resilient, Real-Time Analytics at the Tactical Edge (Navy)

Statistical Model & Training Algorithms (Air Force)

Statistical Model & Training Algorithms (Air Force)

Data Science & Architecture Assessment (Marketing)

Data Science & Architecture Assessment (Marketing)

Text Analytics of PDF Technical Documents (Air Force)

Text Analytics of PDF Technical Documents (Air Force)

Turning Raw Behavioral Data into Predictive Customer Insight with Deep Learning (Retail)

Turning Raw Behavioral Data into Predictive Customer Insight with Deep Learning (Retail)

Automated Data Cleansing with Machine Learning (Navy)

Automated Data Cleansing with Machine Learning (Navy)

Automated Data Capture and Prediction (Air Force)

Automated Data Capture and Prediction (Air Force)

Automated Data Crosswalks (Air Force SBIR)

Automated Data Crosswalks (Air Force SBIR)

Transforming Contracts into Enterprise Insight to Accelerate Acquisition Decisions (Air Force)

Transforming Contracts into Enterprise Insight to Accelerate Acquisition Decisions (Air Force)

Decision Support for Cyber Hygiene (Air Force)

Decision Support for Cyber Hygiene (Air Force)

Identifying & Prioritizing Supply Chain Risks to Improve Readiness (Air Force)

Identifying & Prioritizing Supply Chain Risks to Improve Readiness (Air Force)

Turning Complex Financial Rules into Automated, Scalable Decision Making (Insurance)

Turning Complex Financial Rules into Automated, Scalable Decision Making (Insurance)

Delivering On-Demand Sustainment Insights for Faster, Data-Driven Decisions (Air Force)

Delivering On-Demand Sustainment Insights for Faster, Data-Driven Decisions (Air Force)

Predicting Contract Performance Risks to Enable Proactive Acquisition Management (Air Force)

Predicting Contract Performance Risks to Enable Proactive Acquisition Management (Air Force)

Machine Learning & NLP for Decision Support (Healthcare)

Machine Learning & NLP for Decision Support (Healthcare)

Turning Raw Data into Scalable, Analytics-Ready Pipelines for Data Science & AI (Energy)

Turning Raw Data into Scalable, Analytics-Ready Pipelines for Data Science & AI (Energy)

Engines Forecast Reporting Tool (Air Force)

Engines Forecast Reporting Tool (Air Force)

Interested In Working With Us?