Researchers find AI is bad at predicting GPA, grit, eviction, job training, layoffs, and material hardship – VentureBeat
A paper coauthored by over 112 researchers across 160 data and social science teams found that AI and statistical models, when used to predict six life outcomes for children, parents, and households, werent very accurate even when trained on 13,000 data points from over 4,000 families. They assert that the work is a cautionary tale on the use of predictive modeling, especially in the criminal justice system and social support programs.
Heres a setting where we have hundreds of participants and a rich data set, and even the best AI results are still not accurate, said study co-lead author Matt Salganik, a professor of sociology at Princeton and interim director of the Center for Information Technology Policy at the Woodrow Wilson School of Public and International Affairs. These results show us that machine learning isnt magic; there are clearly other factors at play when it comes to predicting the life course.
The study, which was published this week in the journal Proceedings of the National Academy of Sciences, is the fruit of the Fragile Families Challenge, a multi-year collaboration that sought to recruit researchers to complete a predictive task by predicting the same outcomes using the same data. Over 457 groups applied, of which 160 were selected to participate, and their predictions were evaluated with an error metric that assessed their ability to predict held-out data (i.e., data held by the organizer and not available to the participants).
The Challenge was an outgrowth of the Fragile Families Study (formerly Fragile Families and Child Wellbeing Study) based at Princeton, Columbia University, and the University of Michigan, which has been studying a cohort of about 5,000 children born in 20 large American cities between 1998 and 2000. Its designed to oversample births to unmarried couples in those cities, and to address four questions of interest to researchers and policymakers:
When we began, I really didnt know what a mass collaboration was, but I knew it would be a good idea to introduce our data to a new group of researchers: data scientists, said Sara McLanahan, the William S. Tod Professor of Sociology and Public Affairs at Princeton. The results were eye-opening.
The Fragile Families Study data set consists of modules, each of which is made up of roughly 10 sections, where each section includes questions about a topic asked of the childrens parents, caregivers, teachers, and the children themselves. For example, a mother who recently gave birth might be asked about relationships with extended kin, government programs, and marriage attitudes, while a 9-year-old child might be asked about parental supervision, sibling relationships, and school. In addition to the surveys, the corpus contains the results of in-home assessments, including psychometric testing, biometric measurements, and observations of neighborhoods and homes.
The goal of the Challenge was to predict the social outcomes of children aged 15 years, which encompasses 1,617 variables. From the variables, six were selected to be the focus:
Contributing researchers were provided anonymized background data from 4,242 families and 12,942 variables about each family, as well as training data incorporating the six outcomes for half of the families. Once the Challenge was completed, all 160 submissions were scored using the holdout data.
In the end, even the best of the over 3,000 models submitted which often used complex AI methods and had access to thousands of predictor variables werent spot on. In fact, they were only marginally better than linear regression and logistic regression, which dont rely on any form of machine learning.
Either luck plays a major role in peoples lives, or our theories as social scientists are missing some important variable, added McLanahan. Its too early at this point to know for sure.
Measured by the coefficient of determination, or the correlation of the best models predictions with the ground truth data, material hardship i.e., whether 15-year-old childrens parents suffered financial issues was .23, or 23% accuracy. GPA predictions were 0.19 (19%), while grit, eviction, job training, and layoffs were 0.06 (6%), 0.05 (5%), and 0.03 (3%), respectively.
The results raise questions about the relative performance of complex machine-learning models compared with simple benchmark models. In the Challenge, the simple benchmark model with only a few predictors was only slightly worse than the most accurate submission, and it actually outperformed many of the submissions, concluded the studys coauthors. Therefore, before using complex predictive models, we recommend that policymakers determine whether the achievable level of predictive accuracy is appropriate for the setting where the predictions will be used, whether complex models are more accurate than simple models or domain experts in their setting, and whether possible improvement in predictive performance is worth the additional costs to create, test, and understand the more complex model.
The research team is currently applying for grants to continue studies in this area, and theyve also published 12 of the teams results in a special issue of a journal called Socius, a new open-access journal from the American Sociological Association. In order to support additional research, all the submissions to the Challenge including the code, predictions, and narrative explanations will be made publicly available.
The Challenge isnt the first to expose the predictive shortcomings of AI and machine learning models. The Partnership on AI, a nonprofit coalition committed to the responsible use of AI, concluded in its first-ever report last year that algorithms are unfit to automate the pre-trial bail process or label some people as high-risk and detain them. The use of algorithms in decision making for judges has been known to produce race-based unfair results that are more likely to label African-American inmates as at risk of recidivism.
Its well-understood that AI has a bias problem. For instance, word embedding, a common algorithmic training technique that involves linking words to vectors, unavoidably picks up and at worst amplifies prejudices implicit in source text and dialogue. A recent study by the National Institute of Standards and Technology (NIST) found that many facial recognition systems misidentify people of color more often than Caucasian faces. And Amazons internal recruitment tool which was trained on resumes submitted over a 10-year period was reportedly scrapped because it showed bias against women.
A number of solutions have been proposed, from algorithmic tools to services that detect bias by crowdsourcing large training data sets.
In June 2019, working with experts in AI fairness, Microsoft revised and expanded the data sets it uses to train Face API, a Microsoft Azure API that provides algorithms for detecting, recognizing, and analyzing human faces in images. Last May, Facebook announced Fairness Flow, which automatically sends a warning if an algorithm is making an unfair judgment about a person based on their race, gender, or age. Google recently released the What-If Tool, a bias-detecting feature of the TensorBoard web dashboard for its TensorFlow machine learning framework. Not to be outdone, IBM last fall released AI Fairness 360, a cloud-based, fully automated suite that continually provides [insights] into how AI systems are making their decisions and recommends adjustments such as algorithmic tweaks or counterbalancing data that might lessen the impact of prejudice.
Continued here:
Researchers find AI is bad at predicting GPA, grit, eviction, job training, layoffs, and material hardship - VentureBeat
- 5 Key Takeaways | The Law of the Machine (Learning): Solving Complex AI Challenges - JD Supra - September 17th, 2025 [September 17th, 2025]
- Spectral and Machine Learning Approach Enhances Efficiency of Grape Embryo Rescue | Newswise - Newswise - September 17th, 2025 [September 17th, 2025]
- Helpful Reminders for Patent Eligibility of AI, Machine Learning, and Other Software-Related Inventions - JD Supra - September 17th, 2025 [September 17th, 2025]
- Opening the black box of machine learning-controlled plasma treatments - AIP.ORG - September 17th, 2025 [September 17th, 2025]
- Post-compilation Circuit Scaling for Quantum Machine Learning Models Reveals Resource Trends and Topology Impacts - Quantum Zeitgeist - September 17th, 2025 [September 17th, 2025]
- Machine-learning tool gives doctors a more detailed 3D picture of fetal health - Medical Xpress - September 17th, 2025 [September 17th, 2025]
- Portable Electronic Nose with Machine Learning Enhances VOC Detection in Forensic Science - Chromatography Online - September 15th, 2025 [September 15th, 2025]
- Developing a predictive model for breast cancer detection using radiomics-based mammography and machine learning - SpringerOpen - September 13th, 2025 [September 13th, 2025]
- and correlation of drug solubility via hybrid machine learning and gradient based optimization - Nature - September 11th, 2025 [September 11th, 2025]
- Rice-Houston Methodist partnership uses machine learning to reveal hidden patient groups in common heart valve disease - Rice University - September 11th, 2025 [September 11th, 2025]
- Amazon Uses Machine Learning to Tell Sellers if FBA Is a Good Fit - EcommerceBytes - September 11th, 2025 [September 11th, 2025]
- Eli Lilly Launches AI, Machine Learning Platform Called TuneLab For Biotech Companies - Stocktwits - September 11th, 2025 [September 11th, 2025]
- How AI and Machine Learning are Shaping the Future of Mobile Apps - indiatechnologynews.in - September 11th, 2025 [September 11th, 2025]
- Hybrid AI and semiconductor approaches for power quality improvement - Machine Learning Week 2025 - September 9th, 2025 [September 9th, 2025]
- The Predictive Turn | Preparing to Outthink Adversaries Through Predictive Analytics - Machine Learning Week 2025 - September 9th, 2025 [September 9th, 2025]
- NFL player props, odds and bets: Week 1, 2025 NFL picks, SportsLine Machine Learning Model AI predictions, SGP - CBS Sports - September 9th, 2025 [September 9th, 2025]
- Can machine learning forecast Lobo EV Technologies Ltd. recovery - Bear Alert & Daily Price Action Insights - Newser - September 6th, 2025 [September 6th, 2025]
- Generalised Machine Learning Models Outperform Personalised Models For Cognitive Load Classification In Real-Life Settings - Frontiers - September 6th, 2025 [September 6th, 2025]
- Machine learning for the prediction of blood transfusion risk during or after mitral valve surgery: a multicenter retrospective cohort study - Nature - September 6th, 2025 [September 6th, 2025]
- Machine Learning-Driven Exploration of Composition- and Temperature-Dependent Transport and Thermodynamic Properties in LiF-NaF-KF Molten Salts for... - September 6th, 2025 [September 6th, 2025]
- Machine learning analysis reveals tumor heterogeneity and stromal-immune niches in breast cancer - Nature - September 6th, 2025 [September 6th, 2025]
- Identification of Postoperative Weight Loss Trajectories and Development of a Machine Learning-Based Tool for Predicting Malnutrition in Gastric... - September 6th, 2025 [September 6th, 2025]
- The Relationship Between Number of Pregnancies and Serum 25-Hydroxyvitamin D Levels in Women with a Prior Pregnancy: A Cross - Sectional Analysis,... - September 6th, 2025 [September 6th, 2025]
- Tohoku University Researchers Use Machine Learning to Identify Factors Improving Nickel-Based Catalysts for CO Methanation - geneonline.com - September 6th, 2025 [September 6th, 2025]
- Combining machine learning predictions for Galaxy Payroll Group Limited - Quarterly Growth Report & AI Forecast Swing Trade Picks - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast CLSKW recovery - 2025 Breakouts & Breakdowns & Daily Profit Maximizing Trade Tips - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast Granite Real Estate Investment Trust recovery - July 2025 Spike Watch & Growth Focused Stock Reports - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast VERU recovery - July 2025 Intraday Action & AI Forecasted Entry/Exit Points - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast VCI Global Limited recovery - Market Rally & Expert-Curated Trade Recommendations - Newser - September 5th, 2025 [September 5th, 2025]
- Combining machine learning predictions for AutoNation Inc. - Weekly Trend Summary & Weekly Breakout Watchlists - Newser - September 5th, 2025 [September 5th, 2025]
- Combining machine learning predictions for PLXS - Options Play & Fast Gain Stock Trading Tips - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast Valens Semiconductor Ltd. recovery - July 2025 Action & Free Growth Oriented Trading Recommendations - Newser - September 5th, 2025 [September 5th, 2025]
- Improve cost visibility of Machine Learning workloads on Amazon EKS with AWS Split Cost Allocation Data - Amazon Web Services - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast LFT.PRA recovery - Weekly Trade Recap & Daily Profit Maximizing Trade Tips - Newser - September 5th, 2025 [September 5th, 2025]
- Can machine learning forecast TEAM recovery - 2025 Pullback Review & Free Weekly Chart Analysis and Trade Guides - Newser - September 5th, 2025 [September 5th, 2025]
- Combining machine learning predictions for MSBIP - Weekly Profit Analysis & AI Powered Market Entry Strategies - Newser - September 5th, 2025 [September 5th, 2025]
- Revolutionizing Antibody Discovery with Machine Learning - BIOENGINEER.ORG - September 5th, 2025 [September 5th, 2025]
- The good and bad of machine learning | Letters - The Guardian - September 3rd, 2025 [September 3rd, 2025]
- I'm a machine learning engineer at Amazon who anticipated the ML boom. Here's my advice for staying ahead. - AOL.com - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Dogwood Therapeutics Inc. - July 2025 Breakouts & Weekly Setup with High ROI Potential - Newser - September 3rd, 2025 [September 3rd, 2025]
- Phenotyping valvular heart diseases using the lens of unsupervised machine learning: a scoping review - Nature - September 3rd, 2025 [September 3rd, 2025]
- Students use machine learning to track and protect whale populations - Technology Org - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Triller Group Inc. Equity Warrant - Gap Up & Weekly High Conviction Ideas - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for DallasNews Corporation - Quarterly Trade Report & Technical Entry and Exit Tips - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for System1 Inc. - Weekly Gains Summary & Risk Adjusted Swing Trade Ideas - Newser - September 3rd, 2025 [September 3rd, 2025]
- Unlocking the impossible without compromising on creative control: iZotope Ozone 12 adds new machine learning modules and a more musician-friendly AI... - September 3rd, 2025 [September 3rd, 2025]
- What machine learning models say about SLND.WS - Quarterly Trade Report & Technical Entry and Exit Tips - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Chemed Corporation - Weekly Stock Recap & Growth Focused Entry Reports - Newser - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for TAP.A - Earnings Growth Report & Entry Point Confirmation Alerts - Newser - September 3rd, 2025 [September 3rd, 2025]
- Bridging known and unknown dynamics by transformer-based machine-learning inference from sparse observations - Nature - September 3rd, 2025 [September 3rd, 2025]
- Combining machine learning predictions for Inseego Corp. - July 2025 Retail & Technical Confirmation Trade Alerts - Newser - September 3rd, 2025 [September 3rd, 2025]
- Can machine learning forecast Aditxt Inc. recovery - July 2025 Update & Expert Curated Trade Ideas - Newser - September 3rd, 2025 [September 3rd, 2025]
- I'm a machine learning engineer at Amazon who anticipated the ML boom. Here's my advice for staying ahead. - Business Insider - September 1st, 2025 [September 1st, 2025]
- Machine learning climbs the Jacobs Ladder of optoelectronic properties - Nature - September 1st, 2025 [September 1st, 2025]
- Predicting factors associated with anxiety by patients undergoing treatment for infectious diseases using a random-forest machine learning approach -... - September 1st, 2025 [September 1st, 2025]
- Hideo Kojima used "an AI machine learning rig" to painstakingly download his celebrity friends to Death Stranding 2, but he wasn't happy... - September 1st, 2025 [September 1st, 2025]
- Fibro predict a machine learning risk score for advanced liver fibrosis in the general population using Israeli electronic health records - Nature - September 1st, 2025 [September 1st, 2025]
- Machine learning for preventing stillbirths: is it possible to transform data into life-saving insights? - BMC Pregnancy and Childbirth - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Kura Sushi USA Inc. recovery - 2025 Fundamental Recap & AI Based Buy and Sell Signals - Newser - September 1st, 2025 [September 1st, 2025]
- Combining machine learning predictions for China Liberal Education Holdings Limited - Weekly Profit Recap & Weekly Breakout Watchlists - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Tyson Foods Inc. recovery - 2025 Trade Ideas & Smart Swing Trading Techniques - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast GLBZ recovery - July 2025 Movers & AI Based Buy and Sell Signals - Newser - September 1st, 2025 [September 1st, 2025]
- What machine learning models say about Sypris Solutions Inc. - Market Performance Recap & Real-Time Volume Trigger Notifications - Newser - September 1st, 2025 [September 1st, 2025]
- What machine learning models say about Astria Therapeutics Inc. - July 2025 News Drivers & Real-Time Buy Signal Alerts - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast CRTO recovery - July 2025 Analyst Calls & Growth Focused Investment Plans - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Exelon Corporation recovery - Exit Point & Pattern Based Trade Signal System - Newser - September 1st, 2025 [September 1st, 2025]
- What machine learning models say about OFIX - Bond Market & Long-Term Safe Investment Plans - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Beneficient recovery - Weekly Trade Recap & Breakout Confirmation Alerts - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast BTBDW recovery - 2025 Geopolitical Influence & Weekly High Momentum Picks - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Tri Pointe Homes Inc. recovery - July 2025 WrapUp & Free Long-Term Investment Growth Plans - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast TeraWulf Inc. recovery - Market Movement Recap & Community Supported Trade Ideas - Newser - September 1st, 2025 [September 1st, 2025]
- Combining machine learning predictions for Alset Inc. - 2025 Technical Patterns & Precise Buy Zone Identification - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Exelon Corporation recovery - 2025 Bull vs Bear & Smart Allocation Stock Reports - Newser - September 1st, 2025 [September 1st, 2025]
- Can machine learning forecast Token Cat Limited Depositary Receipt recovery - 2025 Price Action Summary & Breakout Confirmation Alerts - Newser - September 1st, 2025 [September 1st, 2025]
- Combining machine learning predictions for BT Brands Inc. - Market Performance Recap & Verified Technical Trade Signals - Newser - September 1st, 2025 [September 1st, 2025]
- 7 Beginner Machine Learning Projects To Complete This Weekend - KDnuggets - August 29th, 2025 [August 29th, 2025]
- Machine learning approaches for predicting the construction time of drill-and-blast tunnels - Nature - August 29th, 2025 [August 29th, 2025]
- Combining machine learning predictions for KKR.PRD - July 2025 Closing Moves & Technical Pattern Recognition Alerts - Newser - August 29th, 2025 [August 29th, 2025]
- Leveraging data analytics to revolutionize cybersecurity with machine learning and deep learning - Nature - August 29th, 2025 [August 29th, 2025]
- Can machine learning forecast Yext Inc. recovery - Earnings Performance Report & Accurate Buy Signal Notifications - Newser - August 29th, 2025 [August 29th, 2025]