Archive for the ‘Machine Learning’ Category

Machine learning removes bias from algorithms and the hiring process – PRNewswire

Arena Analytics' Chief Data Scientist unveils a cutting edge technique that removes latent bias from algorithmic models.

Currently, the primary methods of reducing the impact of bias on models has been limited to adjusting input data or adjust models after-the-fact to ensure there is no disparate impact.

Recent reporting from the Wall Street Journal confirmed these as the most recent advances, concluding, "It's really up to the software engineers and leaders of the company to figure out how to fix it [or] go into the algorithm and tweak some of the main factors it considers in making its decisions."

For several years, Arena Analytics was also limited to these approaches, but that all changed 9 months ago. Up until then, Arena removed all data from the models that could correlate to protected classifications and then measured demographic parity.

"These efforts brought us in line with EEOC compliance thresholds - also known as the or 80% rule," explains Myra Norton, President/COO of Arena. "But we've always wanted to go further than a compliance threshold.We've wanted to surface a MORE diverse slate of candidates for every role in a client organization.And that's exactly what we've accomplished, now surpassing 95% in our representation of different classifications."

Chief Data Scientist Patrick Hagerty will explain at MLConf the way he and his team have leveraged techniques known asadversarial networks,an aspect of Generative Adversarial Networks (GAN's), tools that pit one algorithm against another.

"Arena's primary model predicts the outcomes our clients want, and Model Two is a Discriminator designed to predict a classification," says Hagerty. "The Discriminator attempts to detect the race, gender, background, and any other protected class data of a person. This causes the Predictor to adjust and optimize while eliminating correlations with the classifications the Discriminator is detecting."

Arena trained models to do this until achieving what's known as the Nash Equilibrium. This is the point at which the predictor and discriminator have reached peak optimization.

Arena's technology has helped industrious individuals find a variety of jobs - from RNs to medtechs, caregivers to cooks, concierge to security. Job candidates who Arena predicted for success include veterans with no prior experience in healthcare or senior/assisted living, recent high school graduates whose plans to work while attending college were up-ended, and former hospitality sector employees who decided to apply their dining service expertise to a new setting.

"We succeeded in our intent to reduce bias and diversify the workforce, but what surprised us was the impact this approach had on our core predictions. Data once considered unusable, such as commuting distance, we can now analyze because we've removed the potentially-associated protected-class-signal," says Michael Rosenbaum, Arena's founder and CEO. "As a result, our predictions are stronger AND we surface a more diverse slate of candidates across multiple spectrums. Our clients can now use their talent acquisition function to really support and lead out front on Diversity and Inclusion."

About Arena (https://www.arena.io/) applies predictive analytics and machine learning to solve talent acquisition challenges. Learning algorithms analyze a large amount of data topredict with high levels of accuracy the likelihood of different outcomes occurring, such as someone leaving, being engaged, having excellent attendance, and more. By revealing each individual's likely outcomes in specific positions, departments, and locations, Arena is transforming the labor market from one based on perception and unconscious bias, to one based on outcomes. Arena is currently growing dramatically within the healthcare and hospitality industry and expanding its offerings to other people intensive industries. For more information contact [emailprotected]arena.io

SOURCE Arena

https://www.arena.io/

Read more here:
Machine learning removes bias from algorithms and the hiring process - PRNewswire

Using machine learning to track the pandemic’s impact on mental health – MIT News

Dealing with a global pandemic has taken a toll on the mental health of millions of people. A team of MIT and Harvard University researchers has shown that they can measure those effects by analyzing the language that people use to express their anxiety online.

Using machine learning to analyze the text of more than 800,000 Reddit posts, the researchers were able to identify changes in the tone and content of language that people used as the first wave of the Covid-19 pandemic progressed, from January to April of 2020. Their analysis revealed several key changes in conversations about mental health, including an overall increase in discussion about anxiety and suicide.

We found that there were these natural clusters that emerged related to suicidality and loneliness, and the amount of posts in these clusters more than doubled during the pandemic as compared to the same months of the preceding year, which is a grave concern, says Daniel Low, a graduate student in the Program in Speech and Hearing Bioscience and Technology at Harvard and MIT and the lead author of the study.

The analysis also revealed varying impacts on people who already suffer from different types of mental illness. The findings could help psychiatrists, or potentially moderators of the Reddit forums that were studied, to better identify and help people whose mental health is suffering, the researchers say.

When the mental health needs of so many in our society are inadequately met, even at baseline, we wanted to bring attention to the ways that many people are suffering during this time, in order to amplify and inform the allocation of resources to support them, says Laurie Rumker, a graduate student in the Bioinformatics and Integrative Genomics PhD Program at Harvard and one of the authors of the study.

Satrajit Ghosh, a principal research scientist at MITs McGovern Institute for Brain Research, is the senior author of the study, which appears in the Journal of MedicalInternet Research. Other authors of the paper include Tanya Talkar, a graduate student in the Program in Speech and Hearing Bioscience and Technology at Harvard and MIT; John Torous, director of the digital psychiatry division at Beth Israel Deaconess Medical Center; and Guillermo Cecchi, a principal research staff member at the IBM Thomas J. Watson Research Center.

A wave of anxiety

The new study grew out of the MIT class 6.897/HST.956 (Machine Learning for Healthcare), in MITs Department of Electrical Engineering and Computer Science. Low, Rumker, and Talkar, who were all taking the course last spring, had done some previous research on using machine learning to detect mental health disorders based on how people speak and what they say. After the Covid-19 pandemic began, they decided to focus their class project on analyzing Reddit forums devoted to different types of mental illness.

When Covid hit, we were all curious whether it was affecting certain communities more than others, Low says. Reddit gives us the opportunity to look at all these subreddits that are specialized support groups. Its a really unique opportunity to see how these different communities were affected differently as the wave was happening, in real-time.

The researchers analyzed posts from 15 subreddit groups devoted to a variety of mental illnesses, including schizophrenia, depression, and bipolar disorder. They also included a handful of groups devoted to topics not specifically related to mental health, such as personal finance, fitness, and parenting.

Using several types of natural language processing algorithms, the researchers measured the frequency of words associated with topics such as anxiety, death, isolation, and substance abuse, and grouped posts together based on similarities in the language used. These approaches allowed the researchers to identify similarities between each groups posts after the onset of the pandemic, as well as distinctive differences between groups.

The researchers found that while people in most of the support groups began posting about Covid-19 in March, the group devoted to health anxiety started much earlier, in January. However, as the pandemic progressed, the other mental health groups began to closely resemble the health anxiety group, in terms of the language that was most often used. At the same time, the group devoted to personal finance showed the most negative semantic change from January to April 2020, and significantly increased the use of words related to economic stress and negative sentiment.

They also discovered that the mental health groups affected the most negatively early in the pandemic were those related to ADHD and eating disorders. The researchers hypothesize that without their usual social support systems in place, due to lockdowns, people suffering from those disorders found it much more difficult to manage their conditions. In those groups, the researchers found posts about hyperfocusing on the news and relapsing back into anorexia-type behaviors since meals were not being monitored by others due to quarantine.

Using another algorithm, the researchers grouped posts into clusters such as loneliness or substance use, and then tracked how those groups changed as the pandemic progressed. Posts related to suicide more than doubled from pre-pandemic levels, and the groups that became significantly associated with the suicidality cluster during the pandemic were the support groups for borderline personality disorder and post-traumatic stress disorder.

The researchers also found the introduction of new topics specifically seeking mental health help or social interaction. The topics within these subreddit support groups were shifting a bit, as people were trying to adapt to a new life and focus on how they can go about getting more help if needed, Talkar says.

While the authors emphasize that they cannot implicate the pandemic as the sole cause of the observed linguistic changes, they note that there was much more significant change during the period from January to April in 2020 than in the same months in 2019 and 2018, indicating the changes cannot be explained by normal annual trends.

Mental health resources

This type of analysis could help mental health care providers identify segments of the population that are most vulnerable to declines in mental health caused by not only the Covid-19 pandemic but other mental health stressors such as controversial elections or natural disasters, the researchers say.

Additionally, if applied to Reddit or other social media posts in real-time, this analysis could be used to offer users additional resources, such as guidance to a different support group, information on how to find mental health treatment, or the number for a suicide hotline.

Reddit is a very valuable source of support for a lot of people who are suffering from mental health challenges, many of whom may not have formal access to other kinds of mental health support, so there are implications of this work for ways that support within Reddit could be provided, Rumker says.

The researchers now plan to apply this approach to study whether posts on Reddit and other social media sites can be used to detect mental health disorders. One current project involves screening posts in a social media site for veterans for suicide risk and post-traumatic stress disorder.

The research was funded by the National Institutes of Health and the McGovern Institute.

Read the rest here:
Using machine learning to track the pandemic's impact on mental health - MIT News

The consistency of machine learning and statistical models in predicting clinical risks of individual patients – The BMJ – The BMJ

Now, imagine a machine learning system with an understanding of every detail of that persons entire clinical history and the trajectory of their disease. With the clinicians push of a button, such a system would be able to provide patient-specific predictions of expected outcomes if no treatment is provided to support the clinician and patient in making what may be life-or-death decisions[1] This would be a major achievement. The English NHS is currently investing 250 million in Artificial Intelligence (AI). Part of this AI work could help to identify patients most at risk of diseases such as heart disease or dementia, allowing for earlier diagnosis and cheaper, more focused, personalised prevention. [2] Multiple papers have suggested that machine learning outperforms statistical models including cardiovascular disease risk prediction. [3-6] We tested whether it is true with prediction of cardiovascular disease as exemplar.

Risk prediction models have been implemented worldwide into clinical practice to help clinicians make treatment decisions. As an example, guidelines by the UK National Institute for Health and Care Excellence recommend that statins are considered for patients with a predicted 10-year cardiovascular disease risk of 10% or more. [7] This is based on the estimation of QRISK which was derived using a statistical model. [8] Our research evaluated whether the predictions of cardiovascular disease risk for an individual patient would be similar if another model, such as a machine learning models were used, as different predictions could lead to different treatment decisions for a patient.

An electronic health record dataset was used for this study with similar risk factor information used across all models. Nineteen different prediction techniques were applied including 12 families of machine learning models (such as neural networks) and seven statistical models (such as Cox proportional hazards models). It was found that the various models had similar population-level model performance (C-statistics of about 0.87 and similar calibration). However, the predictions for individual CVD risks varied widely between and within different types of machine learning and statistical models, especially in patients with higher CVD risks. Most of the machine learning models, tested in this study, do not take censoring into account by default (i.e., loss to follow-up over the 10 years). This resulted in these models substantially underestimating cardiovascular disease risk.

The level of consistency within and between models should be assessed before they are used for treatment decisions making, as an arbitrary choice of technique and model could lead to a different treatment decision.

So, can a push of a button provide patient-specific risk prediction estimates by machine learning? Yes, it can. But should we use such estimates for patient-specific treatment-decision making if these predictions are model-dependant? Machine learning may be helpful in some areas of healthcare such as image recognition, and could be as useful as statistical models on population level prediction tasks. But in terms of predicting risk for individual decision making we think a lot more work could be done. Perhaps the claim that machine learning will revolutionise healthcare is a little premature.

Yan Li, doctoral student of statistical epidemiology, Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester.

Matthew Sperrin, senior lecturer in health data science, Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester.

Darren M Ashcroft, professor of pharmacoepidemiology, Centre for Pharmacoepidemiology and Drug Safety, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester.

Tjeerd Pieter van Staa, professor in health e-research, Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester.

Competing interests: None declared.

References:

Continue reading here:
The consistency of machine learning and statistical models in predicting clinical risks of individual patients - The BMJ - The BMJ

Free Webinar | Machine Learning and Data Analytics in the Pandemic Era – MIT Sloan

Select your countryUnited StatesCanadaAfghanistanAlbaniaAlgeriaAmerican SamoaAndorraAngolaAntigua and BarbudaArgentinaArmeniaAustraliaAustriaAzerbaijanBahamasBahrainBangladeshBarbadosBelarusBelgiumBelizeBeninBermudaBhutanBoliviaBosnia and HerzegovinaBotswanaBrazilBruneiBulgariaBurkina FasoBurundiCambodiaCameroonCanadaCape VerdeCayman IslandsCentral African RepublicChadChileChinaColombiaComorosCongo, Democratic Republic of theCongo, Republic of theCosta RicaCte d'IvoireCroatiaCubaCyprusCzech RepublicDenmarkDjiboutiDominicaDominican RepublicEast TimorEcuadorEgyptEl SalvadorEquatorial GuineaEritreaEstoniaEthiopiaFaroe IslandsFijiFinlandFranceFrench PolynesiaGabonGambiaGeorgiaGermanyGhanaGreeceGreenlandGrenadaGuamGuatemalaGuineaGuinea-BissauGuyanaHaitiHondurasHong KongHungaryIcelandIndiaIndonesiaIranIraqIrelandIsraelItalyJamaicaJapanJordanKazakhstanKenyaKiribatiNorth KoreaSouth KoreaKosovoKuwaitKyrgyzstanLaosLatviaLebanonLesothoLiberiaLibyaLiechtensteinLithuaniaLuxembourgMacedoniaMadagascarMalawiMalaysiaMaldivesMaliMaltaMarshall IslandsMauritaniaMauritiusMexicoMicronesiaMoldovaMonacoMongoliaMontenegroMoroccoMozambiqueMyanmarNamibiaNauruNepalNetherlandsNew ZealandNicaraguaNigerNigeriaNorthern Mariana IslandsNorwayOmanPakistanPalauPalestine, State ofPanamaPapua New GuineaParaguayPeruPhilippinesPolandPortugalPuerto RicoQatarRomaniaRussiaRwandaSaint Kitts and NevisSaint LuciaSaint Vincent and the GrenadinesSamoaSan MarinoSao Tome and PrincipeSaudi ArabiaSenegalSerbiaSeychellesSierra LeoneSingaporeSint MaartenSlovakiaSloveniaSolomon IslandsSomaliaSouth AfricaSpainSri LankaSudanSudan, SouthSurinameSwazilandSwedenSwitzerlandSyriaTaiwanTajikistanTanzaniaThailandTogoTongaTrinidad and TobagoTunisiaTurkeyTurkmenistanTuvaluUgandaUkraineUnited Arab EmiratesUnited KingdomUnited StatesUruguayUzbekistanVanuatuVatican CityVenezuelaVietnamVirgin Islands, BritishVirgin Islands, U.S.YemenZambiaZimbabwe

Select your industryAd AgenciesAgricultureApparelAutomotiveBiotechnologyChemicalsConstructionConsultingConsumer GoodsEducationEnergyEngineeringEntertainmentEnvironmentalFinance & BankingFood & BeverageGovernmentHealth CareHospitalityInsuranceManufacturingMediaNot For ProfitRecreationRetailSecurityServicesTechnologyTelecommunicationsTransportationTravel and LeisureUtilitiesWholesaleOther (please specify)

Privacy Policy

By submitting this form to MIT SMR, you acknowledge that your name and contact information will be shared with SAS Institute Inc., which may contact you regarding the content.

By submitting this form to MIT SMR, you acknowledge that your name and contact information will be shared with SAS Institute Inc., which may contact you regarding the content.

This field is for validation purposes and should be left unchanged.

See the original post here:
Free Webinar | Machine Learning and Data Analytics in the Pandemic Era - MIT Sloan

Google Introduces New Analytics with Machine Learning and Predictive Models – IBL News

IBL News | New York

Google announcedthe introduction of its new Google Analytics with machine learning at its core, which is privacy-centric by design. They are built on the foundation of the App + Web propertypresentedlast year.

The goal of the giant searching company is to help users to get better ROI and improve their marketing decisions. It follows what a survey from Forrester Consulting points out that improving the use of analytics is a top priority for marketers.

The machine learning models include will allow the ability to alert on trends in data, like products seeing rising demand, and help to anticipate future actions from customers. For example, it calculates churn probability so you can more efficiently invest in retaining customers at a time when marketing budgets are under pressure, says in a blog-postVidhya Srinivasan,Vice President, Measurement, Analytics, and Buying Platforms at Google.

It also adds new predictive metrics indicating the potential revenue that can be earned from a particular group of customers. This allows you to create audiences to reach higher-value customers and run analyses to better understand why some customers are likely to spend more than others, so you can take action to improve your results, wroteVidhya Srinivasan.

The new Google Analytics providescustomer-centric measurement, including conversion from YouTube video views, Google and non-Google paid channels, search, social, and email. The setup works with or without cookies or identifiers.

They come by default for new web properties. In order toreplace the existing setup, Google encourages tocreate a new Google Analytics 4 property (previously called an App + Web property). Enterprise marketers are currently using a beta version with an Analytics 360 version with SLAs and advanced integrations with tools like BigQuery.

View post:
Google Introduces New Analytics with Machine Learning and Predictive Models - IBL News