In the AI science boom, beware: your results are only as good as your data – Nature.com
Hunter Moseley says that good reproducibility practices are essential to fully harness the potential of big data.Credit: Hunter N.B. Moseley
We are in the middle of a data-driven science boom. Huge, complex data sets, often with large numbers of individually measured and annotated features, are fodder for voracious artificial intelligence (AI) and machine-learning systems, with details of new applications being published almost daily.
But publication in itself is not synonymous with factuality. Just because a paper, method or data set is published does not mean that it is correct and free from mistakes. Without checking for accuracy and validity before using these resources, scientists will surely encounter errors. In fact, they already have.
In the past few months, members of our bioinformatics and systems-biology laboratory have reviewed state-of-the-art machine-learning methods for predicting the metabolic pathways that metabolites belong to, on the basis of the molecules chemical structures1. We wanted to find, implement and potentially improve the best methods for identifying how metabolic pathways are perturbed under different conditions: for instance, in diseased versus normal tissues.
We found several papers, published between 2011 and 2022, that demonstrated the application of different machine-learning methods to a gold-standard metabolite data set derived from the Kyoto Encyclopedia of Genes and Genomes (KEGG), which is maintained at Kyoto University in Japan. We expected the algorithms to improve over time, and saw just that: newer methods performed better than older ones did. But were those improvements real?
Scientific reproducibility enables careful vetting of data and results by peer reviewers as well as by other research groups, especially when the data set is used in new applications. Fortunately, in keeping with best practices for computational reproducibility, two of the papers2,3 in our analysis included everything that is needed to put their observations to the test: the data set they used, the computer code they wrote to implement their methods and the results generated from that code. Three of the papers24 used the same data set, which allowed us to make direct comparisons. When we did so, we found something unexpected.
It is common practice in machine learning to split a data set in two and to use one subset to train a model and another to evaluate its performance. If there is no overlap between the training and testing subsets, performance in the testing phase will reflect how well the model learns and performs. But in the papers we analysed, we identified a catastrophic data leakage problem: the two subsets were cross-contaminated, muddying the ideal separation. More than 1,700 of 6,648 entries from the KEGG COMPOUND database about one-quarter of the total data set were represented more than once, corrupting the cross-validation steps.
NatureTech
When we removed the duplicates in the data set and applied the published methods again, the observed performance was less impressive than it had first seemed. There was a substantial drop in the F1 score a machine-learning evaluation metric that is similar to accuracy but is calculated in terms of precision and recall from 0.94 to 0.82. A score of 0.94 is reasonably high and indicates that the algorithm is usable in many scientific applications. A score of 0.82, however, suggests that it can be useful, but only for certain applications and only if handled appropriately.
It is, of course, unfortunate that these studies were published with flawed results stemming from the corrupted data set; our work calls their findings into question. But because the authors of two of the studies followed best practices in computational scientific reproducibility and made their data, code and results fully available, the scientific method worked as intended, and the flawed results were detected and (to the best of our knowledge) are being corrected.
The third team, as far as we can tell, included neither their data set nor their code, making it impossible for us to properly evaluate their results. If all of the groups had neglected to make their data and code available, this data-leakage problem would have been almost impossible to catch. That would be a problem not just for the studies that were already published, but also for every other scientist who might want to use that data set for their own work.
More insidiously, the erroneously high performance reported in these papers could dissuade others from attempting to improve on the published methods, because they would incorrectly find their own algorithms lacking by comparison. Equally troubling, it could also complicate journal publication, because demonstrating improvement is often a requirement for successful review potentially holding back research for years.
So, what should we do with these erroneous studies? Some would argue that they should be retracted. We would caution against such a knee-jerk reaction at least as a blanket policy. Because two of the three papers in our analysis included the data, code and full results, we could evaluate their findings and flag the problematic data set. On one hand, that behaviour should be encouraged for instance, by allowing the authors to publish corrections. On the other, retracting studies with both highly flawed results and little or no support for reproducible research would send the message that scientific reproducibility is not optional. Furthermore, demonstrating support for full scientific reproducibility provides a clear litmus test for journals to use when deciding between correction and retraction.
Now, scientific data are growing more complex every day. Data sets used in complex analyses, especially those involving AI, are part of the scientific record. They should be made available along with the code with which to analyse them either as supplemental material or through open data repositories, such as Figshare (Figshare has partnered with Springer Nature, which publishes Nature, to facilitate data sharing in published manuscripts) and Zenodo, that can ensure data persistence and provenance. But those steps will help only if researchers also learn to treat published data with some scepticism, if only to avoid repeating others mistakes.
See the original post here:
In the AI science boom, beware: your results are only as good as your data - Nature.com
- AI is both a new threat and a new solution at the UN climate conference - Business Insider - November 24th, 2024 [November 24th, 2024]
- The Only Free AI Tools You Need for Peak Productivity - How-To Geek - November 24th, 2024 [November 24th, 2024]
- Marc Benioff thinks we've reached the 'upper limits' of LLMs the future, he says, is AI agents - Business Insider - November 24th, 2024 [November 24th, 2024]
- Taiwan Semi (TSM) Positioned for Growth Amid NVIDIAs AI Demand Surge, Says BofA Analyst - Yahoo Finance - November 24th, 2024 [November 24th, 2024]
- AI Models Secretly Learn Capabilities Long Before They Show Them, Researchers Find - Decrypt - November 24th, 2024 [November 24th, 2024]
- Generative AI Revenue on Track to 10X by 2030: 1 AI Stock That Will Benefit (Hint: It's Not Nvidia) - The Motley Fool - November 24th, 2024 [November 24th, 2024]
- Alien Civilizations May Have Already Formed a New Kind of AI-Based Consciousness, Scientists Say - Popular Mechanics - November 24th, 2024 [November 24th, 2024]
- Most Gen Zers are terrified of AI taking their jobs. Their bosses consider themselves immune - Fortune - November 24th, 2024 [November 24th, 2024]
- AI voice scams are on the rise heres how to stay safe, according to security experts - TechRadar - November 24th, 2024 [November 24th, 2024]
- Why you're wrong about AI art, according to the Ai-Da robot that just made a $1 million painting - TechRadar - November 24th, 2024 [November 24th, 2024]
- The curious case of Nebius, the publicly traded AI infrastructure startup - TechCrunch - November 24th, 2024 [November 24th, 2024]
- A new culture war Is brewing and Coca-Cola's AI Christmas ad is at the center - Salon - November 24th, 2024 [November 24th, 2024]
- A new generation of shopping cart, with GPS and AI - CBS News - November 24th, 2024 [November 24th, 2024]
- AI bots could be a new tool to get people to be open about their feelings - Fast Company - November 24th, 2024 [November 24th, 2024]
- Do You Believe That AI Will Ruin Photography? Do You See It Already Happening? - Fstoppers - November 24th, 2024 [November 24th, 2024]
- Weekend Round-Up: AI Dominates Headlines With Nvidia, Elon Musk, And Hollywood's Big Names - Benzinga - November 24th, 2024 [November 24th, 2024]
- Conservationists turn to AI in battle to save red squirrels - BBC.com - November 24th, 2024 [November 24th, 2024]
- The Many Ways WSJ Readers Use AI in Their Everyday Lives - The Wall Street Journal - November 24th, 2024 [November 24th, 2024]
- Jensen says solving AI hallucination problems is 'several years away,' requires increasing computation - Tom's Hardware - November 24th, 2024 [November 24th, 2024]
- 4 AI Data-Center Stocks to Buy for the Big Trend. Demand Is Robust. - Barron's - November 24th, 2024 [November 24th, 2024]
- Nvidia Sees Continued AI Momentum. Is This a Golden Opportunity to Buy the Stock? - The Motley Fool - November 24th, 2024 [November 24th, 2024]
- Stanford Professor Allegedly Includes Fake AI Citations in Filing on Deepfake Bill - PCMag - November 24th, 2024 [November 24th, 2024]
- Ex-Google CEO Eric Schmidt says AI will 'shape' identity and that 'normal people' are not ready for it - Business Insider - November 24th, 2024 [November 24th, 2024]
- AI can be used to create job promotion, not be a job replacement, says AWS vice president - Business Insider - November 24th, 2024 [November 24th, 2024]
- Nvidia Has $71 Million Invested in These Smaller-Cap AI Stocks - Yahoo Finance - November 24th, 2024 [November 24th, 2024]
- Advancing urban tree monitoring with AI-powered digital twins - MIT News - November 24th, 2024 [November 24th, 2024]
- A Pennsylvania boy used AI to make nude images of female students. Was it illegal? - USA TODAY - November 24th, 2024 [November 24th, 2024]
- Wakeup Call for HR: Employees Trust AI More Than They Trust You - Josh Bersin - November 24th, 2024 [November 24th, 2024]
- The AI Reporter That Took My Old Job Just Got Fired - WIRED - November 24th, 2024 [November 24th, 2024]
- US ahead in AI innovation, easily surpassing China in Stanfords new ranking - The Associated Press - November 21st, 2024 [November 21st, 2024]
- Announcing recipients of the Google.org AI Opportunity Fund: Europe - The Keyword - November 21st, 2024 [November 21st, 2024]
- AI agents what they are, and how theyll change the way we work - Microsoft - November 21st, 2024 [November 21st, 2024]
- Shannon Vallor says AI does present an existential risk but not the one you think - Vox.com - November 21st, 2024 [November 21st, 2024]
- US gathers allies to talk AI safety as Trump's vow to undo Biden's AI policy overshadows their work - The Associated Press - November 21st, 2024 [November 21st, 2024]
- Google's AI-Powered OSS-Fuzz Tool Finds 26 Vulnerabilities in Open-Source Projects - The Hacker News - November 21st, 2024 [November 21st, 2024]
- The intersection of AI and the downfall of long-form literature - Tufts Daily - November 21st, 2024 [November 21st, 2024]
- Silicon Valley billionaire warns 'absolutely there's a bubble' in AI valuations: 'Nobody would be surprised' if OpenAI 'disappeared next Monday' -... - November 21st, 2024 [November 21st, 2024]
- Advancing red teaming with people and AI - OpenAI - November 21st, 2024 [November 21st, 2024]
- Can Google Scholar survive the AI revolution? - Nature.com - November 21st, 2024 [November 21st, 2024]
- Nearly half of Gen AI adopters want it open source - here's why - ZDNet - November 21st, 2024 [November 21st, 2024]
- Founder of AI education chatbot charged with defrauding investors of $10 million - USA TODAY - November 21st, 2024 [November 21st, 2024]
- Microsoft at 50: An AI Giant. A Kinder Culture. And Still Hellbent on Domination - WIRED - November 21st, 2024 [November 21st, 2024]
- Matthew Libby on the dark underbelly of AI and his new play Data at Arena Stage - DC Theater Arts - November 21st, 2024 [November 21st, 2024]
- Cruise fesses up, Pony AI raises its IPO ambitions, and the TuSimple drama dials back up - TechCrunch - November 21st, 2024 [November 21st, 2024]
- I Called AI Santa Claus. He Hung Up On Me - The Daily Beast - November 21st, 2024 [November 21st, 2024]
- Nvidia says its Blackwell AI chip is full steam ahead - The Verge - November 21st, 2024 [November 21st, 2024]
- AI in drug discovery is nonsense, but call Schrdinger AI if you want, says CEO - STAT - November 21st, 2024 [November 21st, 2024]
- Is This a Sign That SoundHound AI Is Becoming a Safer Stock to Buy? - The Motley Fool - November 21st, 2024 [November 21st, 2024]
- Why the U.S. Launched an International Network of AI Safety Institutes - TIME - November 21st, 2024 [November 21st, 2024]
- Nvidias boss dismisses fears that AI has hit a wall - The Economist - November 21st, 2024 [November 21st, 2024]
- Will the bubble burst for AI in 2025, or will it start to deliver? - The Economist - November 21st, 2024 [November 21st, 2024]
- Thousands of AI agents later, who even remembers what they do? - The Register - November 21st, 2024 [November 21st, 2024]
- Child safety org flags new CSAM with AI trained on real child sex abuse images - Ars Technica - November 21st, 2024 [November 21st, 2024]
- Nvidias Sales Soar as AI Spending Boom Barrels Ahead - The Wall Street Journal - November 21st, 2024 [November 21st, 2024]
- How Oracle Got Its Mojo Back. What's Behind The AI Cloud Push Powering Its 80% Stock Gain. - Investor's Business Daily - November 21st, 2024 [November 21st, 2024]
- KPMG to spend $100 million on AI partnership with Google Cloud - Reuters - November 21st, 2024 [November 21st, 2024]
- Microsoft is the mystery AI company licensing HarperCollins books, says Bloomberg - The Verge - November 21st, 2024 [November 21st, 2024]
- How Students Can AI-Proof Their Careers - The Wall Street Journal - November 21st, 2024 [November 21st, 2024]
- The US Patent and Trademark Office Banned Staff From Using Generative AI - WIRED - November 21st, 2024 [November 21st, 2024]
- Wall Street strategists aren't relying on AI to drive the stock market rally anymore: Morning Brief - Yahoo Finance - November 19th, 2024 [November 19th, 2024]
- Move over chatbots, AI agents are the next big thing. What are they? - Quartz - November 19th, 2024 [November 19th, 2024]
- Meta AI Begins Roll Out on Ray-Ban Meta Glasses in France, Italy, Ireland and Spain - Meta - November 19th, 2024 [November 19th, 2024]
- Exclusive: Leaked Amazon documents identify critical flaws in the delayed AI reboot of Alexa - Fortune - November 19th, 2024 [November 19th, 2024]
- How Mark Zuckerberg went all-in to make Meta a major AI player and threaten OpenAIs dominance - Fortune - November 19th, 2024 [November 19th, 2024]
- AI maths assistant could help solve problems that humans are stuck on - New Scientist - November 19th, 2024 [November 19th, 2024]
- AI Is Now Co-Creator Of Our Collective Intelligence So Watch Your Back - Forbes - November 19th, 2024 [November 19th, 2024]
- Itching to write a book? AI publisher Spines wants to make a deal - TechCrunch - November 19th, 2024 [November 19th, 2024]
- AI is hitting a wall just as the hype around it reaches the stratosphere - CNN - November 19th, 2024 [November 19th, 2024]
- AI can learn to think before it speaks - Financial Times - November 19th, 2024 [November 19th, 2024]
- Can AI Robots Offer Advice That Heals Souls? - Religion Unplugged - November 19th, 2024 [November 19th, 2024]
- Crook breaks into AI biz, points $250K wire payment at their own account - The Register - November 19th, 2024 [November 19th, 2024]
- Symbotic Stock Rises 28%. Heres Why the AI-Robot Company Is Surging. - Barron's - November 19th, 2024 [November 19th, 2024]
- Leaked: Amazon held talks with Instacart, Uber, Ticketmaster, and others for help on its new AI-powered Alexa - Business Insider - November 19th, 2024 [November 19th, 2024]
- Got $3,000? 3 Artificial Intelligence (AI) Stocks to Buy and Hold for the Long Term - The Motley Fool - November 19th, 2024 [November 19th, 2024]
- Theres No Longer Any Doubt That Hollywood Writing Is Powering AI - The Atlantic - November 19th, 2024 [November 19th, 2024]
- Is AI making job applications easier, or creating another problem? - NBC News - November 19th, 2024 [November 19th, 2024]
- Microsoft announces its own Black Hat-like hacking event with big rewards for AI security - The Verge - November 19th, 2024 [November 19th, 2024]
- AI startup Perplexity adds shopping features as search competition tightens - Reuters - November 19th, 2024 [November 19th, 2024]
- Scientists Are Using AI To Improve Vegan Meat Alternatives - Plant Based News - November 19th, 2024 [November 19th, 2024]
- Microsofts new Copilot Actions use AI to automate repetitive tasks - The Verge - November 19th, 2024 [November 19th, 2024]