The Real AI Bottleneck Isn’t Compute—It’s Trustworthy Data

Terms of use

Terms of Use

The use of this site and the content contained therein is governed by the Terms of Use. When you use this site you acknowledge that you have read the Terms of Use and that you accept and will be bound by the terms hereof and such terms as may be modified from time to time.

All text, graphics, audio, design and other works on the site are the copyrighted works of nasscom unless otherwise indicated. All rights reserved.
Content on the site is for personal use only and may be downloaded provided the material is kept intact and there is no violation of the copyrights, trademarks, and other proprietary rights. Any alteration of the material or use of the material contained in the site for any other purpose is a violation of the copyright of nasscom and / or its affiliates or associates or of its third-party information providers. This material cannot be copied, reproduced, republished, uploaded, posted, transmitted or distributed in any way for non-personal use without obtaining the prior permission from nasscom.
The nasscom Members login is for the reference of only registered nasscom Member Companies.
nasscom reserves the right to modify the terms of use of any service without any liability. nasscom reserves the right to take all measures necessary to prevent access to any service or termination of service if the terms of use are not complied with or are contravened or there is any violation of copyright, trademark or other proprietary right.
From time to time nasscom may supplement these terms of use with additional terms pertaining to specific content (additional terms). Such additional terms are hereby incorporated by reference into these Terms of Use.

Disclaimer

The Company information provided on the nasscom web site is as per data collected by companies. nasscom is not liable on the authenticity of such data.
nasscom has exercised due diligence in checking the correctness and authenticity of the information contained in the site, but nasscom or any of its affiliates or associates or employees shall not be in any way responsible for any loss or damage that may arise to any person from any inadvertent error in the information contained in this site. The information from or through this site is provided "as is" and all warranties express or implied of any kind, regarding any matter pertaining to any service or channel, including without limitation the implied warranties of merchantability, fitness for a particular purpose, and non-infringement are disclaimed. nasscom and its affiliates and associates shall not be liable, at any time, for any failure of performance, error, omission, interruption, deletion, defect, delay in operation or transmission, computer virus, communications line failure, theft or destruction or unauthorised access to, alteration of, or use of information contained on the site. No representations, warranties or guarantees whatsoever are made as to the accuracy, adequacy, reliability, completeness, suitability or applicability of the information to a particular situation.
nasscom or its affiliates or associates or its employees do not provide any judgments or warranty in respect of the authenticity or correctness of the content of other services or sites to which links are provided. A link to another service or site is not an endorsement of any products or services on such site or the site.
The content provided is for information purposes alone and does not substitute for specific advice whether investment, legal, taxation or otherwise. nasscom disclaims all liability for damages caused by use of content on the site.
All responsibility and liability for any damages caused by downloading of any data is disclaimed.
nasscom reserves the right to modify, suspend / cancel, or discontinue any or all sections, or service at any time without notice.

For any grievances under the Information Technology Act 2000, please get in touch with Grievance Officer, Mr. Anirban Mandal at data-query@nasscom.in.

New

See all

No notification found.

The Real AI Bottleneck Isn’t Compute—It’s Trustworthy Data

Bipin Kondalkar

@bipinkondalkar

July 4, 2025

Data Science & AI Community

In 2025, the gold rush isn’t in models—it’s in data. Across boardrooms, the conversation is no longer about whether to adopt artificial intelligence but how fast and how broadly it can be scaled. Enterprises are deploying AI to reinvent customer experience, automate operations, and forecast demand with near-psychic precision. IDC estimates global spending on AI will top $500 billion this year. Generative AI alone is being hailed as the next industrial revolution.

Yet amidst the euphoria, a growing number of AI initiatives are quietly underperforming—or outright failing. The reason? It’s not a lack of computing. Not no shortage of talent. Not even model complexity. The problem is far more fundamental and far more damaging.

It’s bad data.

A World Racing Toward AI—Blind to Its Foundations

Today’s AI strategies are built on sand. Over 80% of enterprise data remains unstructured, unclassified, and often unreliable. According to Gartner, by 2026, 75% of AI projects will fail due to issues stemming from data quality, governance, and model trustworthiness. While companies obsess over fine-tuning models and optimizing inference speeds, most forget the raw fuel AI runs on: data that is complete, clean, contextualized, and accessible.

Unfortunately, that’s rarely the case.

Take the example of a leading global bank that invested millions into an AI model to detect insider trading signals. The model performed well in testing, but in production, it flagged thousands of false positives. The cause? Inconsistent timestamp formats across business units led to skewed event timelines—something never caught because the data was never properly profiled or standardized.

Or look at healthcare, where clinical AI is now assisting in diagnostic decisions. A recent MIT study revealed that 20% of training datasets used to build AI models for disease prediction were duplicated, mislabeled, or missing critical demographic tags. That doesn’t just introduce bias—it could cost lives.

These are not isolated incidents. They reflect a broader truth: when poor-quality data feeds an intelligent system, it doesn’t matter how sophisticated your model is. The result is not insight—it’s noise, risk, and reputational damage.

The Cost of Ignoring Data Quality

Let’s be clear—data is no longer a back-office concern. It is now a strategic asset. And just like any critical asset, when mismanaged, it becomes a liability.

The economic toll of poor data is staggering. IBM estimates the global cost of bad data at over $3.1 trillion annually. At an enterprise level, Gartner reports that companies lose an average of $12.9 million every year due to poor data quality, from wasted marketing spend to flawed forecasting to regulatory penalties. In sectors like finance and pharmaceuticals, the cost is not just monetary—it’s about loss of trust, failed audits, and non-compliance with stringent frameworks like the EU AI Act, HIPAA, or India’s DPDP Act.

AI only amplifies these risks. Unlike traditional software, AI learns from what it’s fed. Feed it biased data, and it will perpetuate discrimination. Feed it outdated data, and it will make decisions based on yesterday’s world. Feed it fragmented data, and it will hallucinate patterns that don’t exist. This is the dark side of AI—one that remains hidden until the damage is done.

A Strategic Shift—Data Quality as a Core Product Discipline

To escape this cycle, a fundamental mindset shift is required. Data quality must not be treated as a compliance checkbox or post-processing fix. It must be managed like a product—with versioning, feedback loops, clear ownership, performance metrics, and user-centric design.

This approach borrows from the discipline of product management and applies it to the enterprise data stack. Instead of passively consuming data, organizations need to actively build and maintain it, like they would a customer-facing application.

Here’s how this strategy plays out:

Enterprises must define what “good data” means in their context. This involves establishing quality KPIs—such as completeness, consistency, timeliness, lineage, and usability. These metrics must be aligned not just with IT standards but with business goals.

They must embed quality assurance into every stage of the data lifecycle. This means deploying schema validation, anomaly detection, deduplication, and enrichment directly into ingestion and processing layers.

AI must be used to fix AI’s fuel. Machine learning-based data remediation tools can now identify and auto-correct anomalies, missing values, and mismatches at scale. Generative techniques like data synthesis and imputation are also evolving to support downstream model reliability without overfitting.

Governance must be federated—but coordinated. Centralized data teams often struggle with context. Instead, federated governance—where data ownership is pushed to domain experts but aligned via common standards and policy orchestration—ensures quality is both local and consistent. Metadata catalogs, lineage graphs, and data contracts between producers and consumers are essential in enforcing this model.

Organizations must operationalize quality metrics into dashboards visible to C-level leadership. Just as product teams report on NPS and adoption, data teams must report on uptime, error rates, trust scores, and business impact, turning invisible data problems into tangible business conversations.

Disclaimer

That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.

Bipin Kondalkar

Legal AI Chatbots: Benefits and Use...

elint AI

AI

07 Jul 2025

The Enterprise Sprint and Marathon ...

Janhvi Juyal

Digital Transfo..

07 Jul 2025

Application Security Assessments: M...

Inspirisys Solutions..

Data Privacy

04 Jul 2025

How Professional ICO Development Se...

Marco luther

Blockchain

04 Jul 2025

Achieving Operational Excellence in...

C5i (Course5 Intelli..

Analytics

03 Jul 2025

Personalized Data Policy Creation &...

Shubham Pawar

Data Privacy

03 Jul 2025

Personalized Data Policy Creation &...

Bipin Kondalkar

Data Privacy

02 Jul 2025

OT SOC for Utilities: Making Resili...

Cisco India

Energy & Ut..

02 Jul 2025

AI Workbenches Powering Underwritin...

TestingXperts

AI

02 Jul 2025

Protected Health Information (PHI)

Bipin Kondalkar

HealthTech and ..

01 Jul 2025

How Modern Insurance Agency Softwar...

Ken Milko

Fintech

01 Jul 2025

Preserving Legacy During Business A...

Fenix Venture

Project Managem..

01 Jul 2025

Legal AI Chatbots: Benefits and Use Cases

elint AI

@elintAI

07 Jul 2025

In a world where legal services are becoming increasingly digital, AI-powered legal chatbots are changing how law firms, legal departments, and justice systems operate. From improving client communication to automating time-consuming legal tasks,…

The Enterprise Sprint and Marathon in the Agentic AI Race

Janhvi Juyal

@juyal janhvi

07 Jul 2025

Digital Transformation Data Science & AI Community AI Industry Trends

AI agents are real, here, and now—with 88% of CXOs from our global enterprise survey indicating plans to allocate funds for building agents. Global trends in growing adoption of AI agents have been captured in the recently launched nasscom report…

How Professional ICO Development Services Ensure a Successful Token Launch

Marco luther

@marcoluther

04 Jul 2025

Blockchain

In the ever-evolving landscape of blockchain and cryptocurrency, Initial Coin Offerings (ICOs) remain a powerful way for startups and projects to raise capital. However, the sheer number of failed or underperforming ICOs in recent years has made one…

Achieving Operational Excellence in CPG Through Advanced Analytics and Network Visibility Control Tower

C5i (Course5 ..

@Ronald Fernandes

03 Jul 2025

Analytics

Achieving Operational Excellence in CPG Through Advanced Analytics and Network Visibility Control Tower Supply Chain Operational efficiency is the key to staying ahead of the competition in the dynamic and fast-paced Consumer Packaged…

AI Workbenches Powering Underwriting | Catch Up or Leap Ahead

TestingXperts..

@testing_xperts

02 Jul 2025

Underwriters today are dealing with an array of challenges that are affecting their expertise and efficiency. New Tech trends, climate crises, and global instability have given rise to complexities demanding agility in risk assessment and analysis.…

Preserving Legacy During Business Acquisitions: Why It Matters?

Fenix Venture

@FenixVenture

01 Jul 2025

Project Management

When a business changes hands, much of the conversation revolves around numbers—valuations, earnings, and projected growth. But beneath the surface of every business acquisition lies something more human: the legacy of the founder. Preserving that…

New

The Real AI Bottleneck Isn’t Compute—It’s Trustworthy Data

Bipin Kondalkar

A World Racing Toward AI—Blind to Its Foundations

The Cost of Ignoring Data Quality

A Strategic Shift—Data Quality as a Core Product Discipline

Bipin Kondalkar

Legal AI Chatbots: Benefits and Use Cases

elint AI

The Enterprise Sprint and Marathon in the Agentic AI Race

Janhvi Juyal

How Professional ICO Development Services Ensure a Successful Token Launch

Marco luther

Achieving Operational Excellence in CPG Through Advanced Analytics and Network Visibility Control Tower

C5i (Course5 ..

AI Workbenches Powering Underwriting | Catch Up or Leap Ahead

TestingXperts..

Preserving Legacy During Business Acquisitions: Why It Matters?

Fenix Venture

About Us

Knowledge Center

In the News

Topics In Demand

Notification

New

The Real AI Bottleneck Isn’t Compute—It’s Trustworthy Data

A World Racing Toward AI—Blind to Its Foundations

The Cost of Ignoring Data Quality

A Strategic Shift—Data Quality as a Core Product Discipline

Share this blog

Related blogs

elint AI

07 Jul 2025

Janhvi Juyal

07 Jul 2025

Inspirisys Solutions..

04 Jul 2025

Marco luther

04 Jul 2025

C5i (Course5 Intelli..

03 Jul 2025

Shubham Pawar

03 Jul 2025

Bipin Kondalkar

02 Jul 2025

Cisco India

02 Jul 2025

TestingXperts

02 Jul 2025

Bipin Kondalkar

01 Jul 2025

Ken Milko

01 Jul 2025

Fenix Venture

01 Jul 2025

About Us

Knowledge Center

In the News

Newsletter