Machine learning results: pay attention to what you don’t see – STAT
Even as machine learning and artificial intelligence are drawing substantial attention in health care, overzealousness for these technologies has created an environment in which other critical aspects of the research are often overlooked.
Theres no question that the increasing availability of large data sources and off-the-shelf machine learning tools offer tremendous resources to researchers. Yet a lack of understanding about the limitations of both the data and the algorithms can lead to erroneous or unsupported conclusions.
Given that machine learning in the health domain can have a direct impact on peoples lives, broad claims emerging from this kind of research should not be embraced without serious vetting. Whether conducting health care research or reading about it, make sure to consider what you dont see in the data and analyses.
advertisement
One key question to ask is: Whose information is in the data and what do these data reflect?
Common forms of electronic health data, such as billing claims and clinical records, contain information only on individuals who have encounters with the health care system. But many individuals who are sick dont or cant see a doctor or other health care provider and so are invisible in these databases. This may be true for individuals with lower incomes or those who live in rural communities with rising hospital closures. As University of Toronto machine learning professor Marzyeh Ghassemi said earlier this year:
Even among patients who do visit their doctors, health conditions are not consistently recorded. Health data also reflect structural racism, which has devastating consequences.
Data from randomized trials are not immune to these issues. As a ProPublica report demonstrated, black and Native American patients are drastically underrepresented in cancer clinical trials. This is important to underscore given that randomized trials are frequently highlighted as superior in discussions about machine learning work that leverages nonrandomized electronic health data.
In interpreting results from machine learning research, its important to be aware that the patients in a study often do not depict the population we wish to make conclusions about and that the information collected is far from complete.
It has become commonplace to evaluate machine learning algorithms based on overall measures like accuracy or area under the curve. However, one evaluation metric cannot capture the complexity of performance. Be wary of research that claims to be ready for translation into clinical practice but only presents a leader board of tools that are ranked based on a single metric.
As an extreme illustration, an algorithm designed to predict a rare condition found in only 1% of the population can be extremely accurate by labeling all individuals as not having the condition. This tool is 99% accurate, but completely useless. Yet, it may outperform other algorithms if accuracy is considered in isolation.
Whats more, algorithms are frequently not evaluated based on multiple hold-out samples in cross-validation. Using only a single hold-out sample, which is done in many published papers, often leads to higher variance and misleading metric performance.
Beyond examining multiple overall metrics of performance for machine learning, we should also assess how tools perform in subgroups as a step toward avoiding bias and discrimination. For example, artificial intelligence-based facial recognition software performed poorly when analyzing darker-skinned women. Many measures of algorithmic fairness center on performance in subgroups.
Bias in algorithms has largely not been a focus in health care research. That needs to change. A new study found substantial racial bias against black patients in a commercial algorithm used by many hospitals and other health care systems. Other work developed algorithms to improve fairness for subgroups in health care spending formulas.
Subjective decision-making pervades research. Who decides what the research question will be, which methods will be applied to answering it, and how the techniques will be assessed all matter. Diverse teams are needed not just because they yield better results. As Rediet Abebe, a junior fellow of Harvards Society of Fellows, has written, In both private enterprise and the public sector, research must be reflective of the society were serving.
The influx of so-called digital data thats available through search engines and social media may be one resource for understanding the health of individuals who do not have encounters with the health care system. There have, however, been notable failures with these data. But there are also promising advances using online search queries at scale where traditional approaches like conducting surveys would be infeasible.
Increasingly granular data are now becoming available thanks to wearable technologies such as Fitbit trackers and Apple Watches. Researchers are actively developing and applying techniques to summarize the information gleaned from these devices for prevention efforts.
Much of the published clinical machine learning research, however, focuses on predicting outcomes or discovering patterns. Although machine learning for causal questions in health and biomedicine is a rapidly growing area, we dont see a lot of this work yet because it is new. Recent examples of it include the comparative effectiveness of feeding interventions in a pediatric intensive care unit and the effectiveness of different types of drug-eluting coronary artery stents.
Understanding how the data were collected and using appropriate evaluation metrics will also be crucial for studies that incorporate novel data sources and those attempting to establish causality.
In our drive to improve health with (and without) machine learning, we must not forget to look for what is missing: What information do we not have about the underlying health care system? Why might an individual or a code be unobserved? What subgroups have not been prioritized? Who is on the research team?
Giving these questions a place at the table will be the only way to see the whole picture.
Sherri Rose, Ph.D., is associate professor of health care policy at Harvard Medical School and co-author of the first book on machine learning for causal inference, Targeted Learning (Springer, 2011).
See the article here:
Machine learning results: pay attention to what you don't see - STAT
- and correlation of drug solubility via hybrid machine learning and gradient based optimization - Nature - September 11th, 2025 [September 11th, 2025]
- Rice-Houston Methodist partnership uses machine learning to reveal hidden patient groups in common heart valve disease - Rice University - September 11th, 2025 [September 11th, 2025]
- Amazon Uses Machine Learning to Tell Sellers if FBA Is a Good Fit - EcommerceBytes - September 11th, 2025 [September 11th, 2025]
- Eli Lilly Launches AI, Machine Learning Platform Called TuneLab For Biotech Companies - Stocktwits - September 11th, 2025 [September 11th, 2025]
- How AI and Machine Learning are Shaping the Future of Mobile Apps - indiatechnologynews.in - September 11th, 2025 [September 11th, 2025]
- Hybrid AI and semiconductor approaches for power quality improvement - Machine Learning Week 2025 - September 9th, 2025 [September 9th, 2025]
- The Predictive Turn | Preparing to Outthink Adversaries Through Predictive Analytics - Machine Learning Week 2025 - September 9th, 2025 [September 9th, 2025]
- NFL player props, odds and bets: Week 1, 2025 NFL picks, SportsLine Machine Learning Model AI predictions, SGP - CBS Sports - September 9th, 2025 [September 9th, 2025]
- Can machine learning forecast Lobo EV Technologies Ltd. recovery - Bear Alert & Daily Price Action Insights - Newser - September 6th, 2025 [September 6th, 2025]
- Generalised Machine Learning Models Outperform Personalised Models For Cognitive Load Classification In Real-Life Settings - Frontiers - September 6th, 2025 [September 6th, 2025]
- Machine learning for the prediction of blood transfusion risk during or after mitral valve surgery: a multicenter retrospective cohort study - Nature - September 6th, 2025 [September 6th, 2025]
- Machine Learning-Driven Exploration of Composition- and Temperature-Dependent Transport and Thermodynamic Properties in LiF-NaF-KF Molten Salts for... - September 6th, 2025 [September 6th, 2025]
- Machine learning analysis reveals tumor heterogeneity and stromal-immune niches in breast cancer - Nature - September 6th, 2025 [September 6th, 2025]
- Identification of Postoperative Weight Loss Trajectories and Development of a Machine Learning-Based Tool for Predicting Malnutrition in Gastric... - September 6th, 2025 [September 6th, 2025]
- The Relationship Between Number of Pregnancies and Serum 25-Hydroxyvitamin D Levels in Women with a Prior Pregnancy: A Cross - Sectional Analysis,... - September 6th, 2025 [September 6th, 2025]
- Tohoku University Researchers Use Machine Learning to Identify Factors Improving Nickel-Based Catalysts for CO Methanation - geneonline.com - September 6th, 2025 [September 6th, 2025]
- Combining machine learning predictions for Galaxy Payroll Group Limited - Quarterly Growth Report & AI Forecast Swing Trade Picks - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast CLSKW recovery - 2025 Breakouts & Breakdowns & Daily Profit Maximizing Trade Tips - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast Granite Real Estate Investment Trust recovery - July 2025 Spike Watch & Growth Focused Stock Reports - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast VERU recovery - July 2025 Intraday Action & AI Forecasted Entry/Exit Points - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast VCI Global Limited recovery - Market Rally & Expert-Curated Trade Recommendations - Newser - September 5th, 2025 [September 5th, 2025]
- Combining machine learning predictions for AutoNation Inc. - Weekly Trend Summary & Weekly Breakout Watchlists - Newser - September 5th, 2025 [September 5th, 2025]
- Combining machine learning predictions for PLXS - Options Play & Fast Gain Stock Trading Tips - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast Valens Semiconductor Ltd. recovery - July 2025 Action & Free Growth Oriented Trading Recommendations - Newser - September 5th, 2025 [September 5th, 2025]
- Improve cost visibility of Machine Learning workloads on Amazon EKS with AWS Split Cost Allocation Data - Amazon Web Services - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast LFT.PRA recovery - Weekly Trade Recap & Daily Profit Maximizing Trade Tips - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast TEAM recovery - 2025 Pullback Review & Free Weekly Chart Analysis and Trade Guides - Newser - September 5th, 2025 [September 5th, 2025]
- Combining machine learning predictions for MSBIP - Weekly Profit Analysis & AI Powered Market Entry Strategies - Newser - September 5th, 2025 [September 5th, 2025]
- Revolutionizing Antibody Discovery with Machine Learning - BIOENGINEER.ORG - September 5th, 2025 [September 5th, 2025]
- The good and bad of machine learning | Letters - The Guardian - September 3rd, 2025 [September 3rd, 2025]
- I'm a machine learning engineer at Amazon who anticipated the ML boom. Here's my advice for staying ahead. - AOL.com - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Dogwood Therapeutics Inc. - July 2025 Breakouts & Weekly Setup with High ROI Potential - Newser - September 3rd, 2025 [September 3rd, 2025]
- Phenotyping valvular heart diseases using the lens of unsupervised machine learning: a scoping review - Nature - September 3rd, 2025 [September 3rd, 2025]
- Students use machine learning to track and protect whale populations - Technology Org - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Triller Group Inc. Equity Warrant - Gap Up & Weekly High Conviction Ideas - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for DallasNews Corporation - Quarterly Trade Report & Technical Entry and Exit Tips - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for System1 Inc. - Weekly Gains Summary & Risk Adjusted Swing Trade Ideas - Newser - September 3rd, 2025 [September 3rd, 2025]
- Unlocking the impossible without compromising on creative control: iZotope Ozone 12 adds new machine learning modules and a more musician-friendly AI... - September 3rd, 2025 [September 3rd, 2025]
- What machine learning models say about SLND.WS - Quarterly Trade Report & Technical Entry and Exit Tips - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Chemed Corporation - Weekly Stock Recap & Growth Focused Entry Reports - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for TAP.A - Earnings Growth Report & Entry Point Confirmation Alerts - Newser - September 3rd, 2025 [September 3rd, 2025]
- Bridging known and unknown dynamics by transformer-based machine-learning inference from sparse observations - Nature - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Inseego Corp. - July 2025 Retail & Technical Confirmation Trade Alerts - Newser - September 3rd, 2025 [September 3rd, 2025]
- Can machine learning forecast Aditxt Inc. recovery - July 2025 Update & Expert Curated Trade Ideas - Newser - September 3rd, 2025 [September 3rd, 2025]
- I'm a machine learning engineer at Amazon who anticipated the ML boom. Here's my advice for staying ahead. - Business Insider - September 1st, 2025 [September 1st, 2025]
- Machine learning climbs the Jacobs Ladder of optoelectronic properties - Nature - September 1st, 2025 [September 1st, 2025]
- Predicting factors associated with anxiety by patients undergoing treatment for infectious diseases using a random-forest machine learning approach -... - September 1st, 2025 [September 1st, 2025]
- Hideo Kojima used "an AI machine learning rig" to painstakingly download his celebrity friends to Death Stranding 2, but he wasn't happy... - September 1st, 2025 [September 1st, 2025]
- Fibro predict a machine learning risk score for advanced liver fibrosis in the general population using Israeli electronic health records - Nature - September 1st, 2025 [September 1st, 2025]
- Machine learning for preventing stillbirths: is it possible to transform data into life-saving insights? - BMC Pregnancy and Childbirth - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Kura Sushi USA Inc. recovery - 2025 Fundamental Recap & AI Based Buy and Sell Signals - Newser - September 1st, 2025 [September 1st, 2025]
- Combining machine learning predictions for China Liberal Education Holdings Limited - Weekly Profit Recap & Weekly Breakout Watchlists - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Tyson Foods Inc. recovery - 2025 Trade Ideas & Smart Swing Trading Techniques - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast GLBZ recovery - July 2025 Movers & AI Based Buy and Sell Signals - Newser - September 1st, 2025 [September 1st, 2025]
- What machine learning models say about Sypris Solutions Inc. - Market Performance Recap & Real-Time Volume Trigger Notifications - Newser - September 1st, 2025 [September 1st, 2025]
- What machine learning models say about Astria Therapeutics Inc. - July 2025 News Drivers & Real-Time Buy Signal Alerts - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast CRTO recovery - July 2025 Analyst Calls & Growth Focused Investment Plans - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Exelon Corporation recovery - Exit Point & Pattern Based Trade Signal System - Newser - September 1st, 2025 [September 1st, 2025]
- What machine learning models say about OFIX - Bond Market & Long-Term Safe Investment Plans - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Beneficient recovery - Weekly Trade Recap & Breakout Confirmation Alerts - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast BTBDW recovery - 2025 Geopolitical Influence & Weekly High Momentum Picks - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Tri Pointe Homes Inc. recovery - July 2025 WrapUp & Free Long-Term Investment Growth Plans - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast TeraWulf Inc. recovery - Market Movement Recap & Community Supported Trade Ideas - Newser - September 1st, 2025 [September 1st, 2025]
- Combining machine learning predictions for Alset Inc. - 2025 Technical Patterns & Precise Buy Zone Identification - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Exelon Corporation recovery - 2025 Bull vs Bear & Smart Allocation Stock Reports - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Token Cat Limited Depositary Receipt recovery - 2025 Price Action Summary & Breakout Confirmation Alerts - Newser - September 1st, 2025 [September 1st, 2025]
- Combining machine learning predictions for BT Brands Inc. - Market Performance Recap & Verified Technical Trade Signals - Newser - September 1st, 2025 [September 1st, 2025]
- 7 Beginner Machine Learning Projects To Complete This Weekend - KDnuggets - August 29th, 2025 [August 29th, 2025]
- Machine learning approaches for predicting the construction time of drill-and-blast tunnels - Nature - August 29th, 2025 [August 29th, 2025]
- Combining machine learning predictions for KKR.PRD - July 2025 Closing Moves & Technical Pattern Recognition Alerts - Newser - August 29th, 2025 [August 29th, 2025]
- Leveraging data analytics to revolutionize cybersecurity with machine learning and deep learning - Nature - August 29th, 2025 [August 29th, 2025]
- Can machine learning forecast Yext Inc. recovery - Earnings Performance Report & Accurate Buy Signal Notifications - Newser - August 29th, 2025 [August 29th, 2025]
- Combining machine learning predictions for Mercer International Inc. - July 2025 Highlights & Real-Time Volume Analysis - Newser - August 29th, 2025 [August 29th, 2025]
- Combining machine learning predictions for Kandal M Venture Limited - Inflation Watch & Verified Technical Signals - Newser - August 29th, 2025 [August 29th, 2025]
- Combining machine learning predictions for Asbury Automotive Group Inc. - July 2025 Intraday Action & Daily Volume Surge Signals - Newser - August 29th, 2025 [August 29th, 2025]
- Can machine learning forecast NINE recovery - Quarterly Performance Summary & Technical Entry and Exit Tips - Newser - August 29th, 2025 [August 29th, 2025]
- IQUP identifies quantitatively unreliable spectra with machine learning for isobaric labeling-based proteomics - Nature - August 29th, 2025 [August 29th, 2025]
- Can machine learning forecast HealthEquity Inc. recovery - Exit Point & High Accuracy Buy Signal Tips - Newser - August 29th, 2025 [August 29th, 2025]
- Machine learning-based prediction of unconfined compressive strength of organic-rich clay shales using hybrid destructive and non-destructive inputs -... - August 29th, 2025 [August 29th, 2025]
- Can machine learning forecast WFC.PRL recovery - Market Volume Summary & Smart Investment Allocation Insights - Newser - August 29th, 2025 [August 29th, 2025]