Archive for the ‘Machine Learning’ Category

Accurate and rapid antibiotic susceptibility testing using a machine learning-assisted nanomotion technology platform – Nature.com

All experimental procedures and bioinformatic analyses in this work comply with ethical regulations and good scientific practices. An ethics approval for the pre-clinical experiments was not required as anonymized biological material, i.e., anonymized blood for the blood culture incubation, was provided by a blood donation center in Switzerland. The clinical study protocol for the PHENOTECH-1 study (NCT05613322) was approved by the Ethics Committee for Investigation with Medicinal Products (CEIm) in Madrid (ID 239/22), the Cantonal Commission for Ethics in Research on Human Beings (CER-VD) in Lausanne (ID 2022-02085), and the Ethics Committee of the Medical University of Innsbruck in Innsbruck (ID 1271/2022).

The strain collection used in this study consists of ATCC reference strains and clinical isolates either from patient blood samples at hospital sites or procured from strain collections (Supplementary Data1). In order to establish a methodology for nanomotion-based AST, we used the E. coli reference strain ATCC-25922, which is susceptible to ceftriaxone (CRO; ceftriaxone disodium salt hemi(heptahydrate) analytical standard, Merck & Cie, Schaffhausen, Switzerland), cefotaxime (CTX; cefotaxime sodium, Pharmaceutical Secondary Standard, Supelco, Merck & Cie, Schaffhausen, Switzerland), ciprofloxacin (CIP; ciprofloxacin, VETRANAL, analytical standard, Merck & Cie, Schaffhausen, Switzerland), and ceftazidime-avibactam (SigmaAldrich, Merck & Cie, Schaffhausen, Switzerland). Our reference strains for antibiotic resistance were BAA-2452 (resistant to CRO and CTX, blaNDM producer) and BAA-2469 (resistant to CIP). The K. pneumoniae reference isolates ATCC-27736 was susceptible to CRO.

To differentiate between resistant and susceptible phenotypes, clinical isolates were selected based on their MIC in accordance with the European Committee on Antimicrobial Susceptibility Testing (EUCAST) interpretation guidelines59. MIC strips and disk diffusion tests were performed on MH Agar plates (Mueller-Hinton agar VWR International GmbH, Dietikon, Switzerland). During all nanomotion experiments, bacteria in the measurement chamber were incubated with filtered (0.02 m Polyethersulfone, PES, Corning, or Millipore) LB (Millers LB Broth, Corning) half-diluted in deionized water (Molecular Biology Grade Water, Cytiva), hereafter referred to as 50% LB.

All bacterial strains were stored at 80C in 20% glycerol. Bacterial samples for nanomotion experiments were prepared by first thawing new cell aliquots and growing them at 37C on Columbia agar medium solid plates (Columbia blood Agar, 5% sheep blood, VWR International GmbH, Dietikon, Switzerland). These cells were then used to inoculate blood culture medium and subsequently grown for nanomotion experimentation.

We performed MIC gradient tests (MIC strips) to determine the minimal inhibitory concentration (MIC) for each antibiotic used in this study. Cell suspensions were prepared by selecting three to five colonies grown overnight (ON) at 37C on a Columbia agar plate and resuspending them in 0.9% NaCl solution (Sodium Chloride, 0.9%, HuberLab, PanReac Applichem) at a density of 0.5 McFarland units (corresponding to OD600nm=0.07). This suspension was then spread on MH plates using a sterile cotton swab to create a confluent culture. MIC strips (ceftriaxone 0.016256g/mL, ciprofloxacin 0.00232g/mL, cefotaxime 0.016256g/mL, ceftazidime 0.016256g/mL, and ceftazidime-avibactam 0.016/4256/4g/mL MIC test strips, Liofilchem, Roseto degli Abruzzi, Teramo, Italy) were then placed onto inoculated plates using tweezers. The plates were subsequently incubated at 37C for 1620h, with the growth inhibition area surrounding the MIC strip present after this incubation period used to interpret MICs.

While MIC strips served as the primary AST reference method, some situations presented difficult interpretations or exceeded the scale of the CRO MIC strips. Here, broth microdilution assays were performed according to EUCAST recommendations59. Furthermore, a disk diffusion assay (DDA) was performed in parallel to each sample assessed using nanomotion technology for quality assurance purposes20,60.

To facilitate bacterial attachment and prevent cellular detachment during AST recording, we incubated the cantilever with 50l of 0.1mg/ml PDL (Poly-D-Lysine hydrobromide, MP Biomedicals, Santa Ana, California, USA) diluted in molecular biology grade water (HyClone, Logan, Utah, United States) for 20min at room temperature (RT). This treatment created a homogenous positive electric charge that enabled the attachment of negatively charged bacteria. Following incubation, thePDLdrop was removed and discarded, after which the cantilever tip was gently washed with 100l of molecular biology-grade water. The sensors on the cantilever were then allowed to dry for at least 15min before use.

Spiking refers to the process of inoculating blood culture samples with artificially infected blood. Here, we cultured strains of interest on Columbia Agar plates ON at 37C, isolated a single colony, and resuspended it in 0.9% NaCl with volumes adjusted to obtain a 0.5 McFarland density. We then performed two 1:10 serial dilutions, starting with that suspension, to generate a final dilution of 1:100. Finally, 10l of the final dilution were added to 9990l of EDTA blood from a donor provided by a blood donation center in Switzerland. Blood has been received fully anonymized.

To generate spiked blood cultures, we added 10ml of artificially infected blood to either anaerobic (ANH) or aerobic (AEH) blood culture bottles (BD BACTECTM Lytic Anaerobic medium and BD BACTECTM Standard Aerobic medium Culture Vials; Becton Dickinson, Eysins, Switzerland) using a syringe. These culture bottles were then incubated until positivity, as determined by the BACTECTM 9240 automated blood culture system (Becton Dickinson), was reached. In most cases, this process took 12h or an overnight incubation.

To generate and purify bacterial pellets for nanomotion recordings, we used either the MBT Sepsityper IVD Kit (Bruker) or the direct attachment method (DA). When using the MBT Sepsityper IVD Kit, we followed the manufacturers instructions. Briefly, 1ml of blood culture was combined with 200l Lysis Buffer, mixed by vortexing, and then centrifuged for 2min at 12,000g to obtain a bacterial pellet. The supernatant was discarded, while the bacterial pellet was resuspended in 1ml of Washing Buffer. The resuspension was then centrifuged again for 1min at 12,000g to remove debris. For DA, 1ml of positive blood culture (PBC) was syringe filtered (5m pore size, Acrodisc Syringe Filters with Supor Membrane, Pall, Fribourg, Switzerland). The pellet was then used for attachment to the cantilever.

Bacterial cells from prepared pellets needed to be immobilized onto the surface of the functionalized cantilever for nanomotion recording. First, pellets were resuspended in a PBS (Phosphate Buffer Saline, Corning) solution containing 0.04% agarose. Next, the sensor was placed on a clean layer of Parafilm M (Amcor, Victoria, Australia). The tip of the sensor, containing the chip with the cantilever, was placed into contact with a single drop of bacterial cell suspension for 1min. After this, the sensor was removed, gently washed with PBS, and assessed using phase microscopy for attachment quality. In the event of unsatisfactory attachment, the sensor was re-incubated in the cell suspension for an additional 3060s, or until satisfactory attachment was achieved. We aimed for an even bacterial distribution across the sensor (Fig.1b, c, and Supplementary Fig.2). The attachment of bacteria is part of a filed patent (PCT/EP2020/087821).

Our nanomotion measurement platform, the Resistell Phenotech device (Resistell AG, Muttenz, Switzerland), comprises a stainless-steel head with a measurement fluid chamber, an active vibration damping system, acquisition and control electronics, and a computer terminal.

Nanomotion-based AST strategies utilize technologies that are well-established in atomic-force microscopy (AFM). Specifically, our nanomotion detection system is based on an AFM setup for cantilever-based optical deflection detection. However, in contrast to standard AFM devices, in the Phenotech device the light source and the photodetector are placed below the cantilever to facilitate the experimental workflow. A light beam, focused at the cantilever end, originates in a superluminescent diode (SLED) module (wavelength: 650mm, optical power: 2mW), is reflected, and reaches a four-sectional position-sensitive photodetector that is a part of a custom-made precision preamplifier (Resistell AG). The flexural deflection of the cantilever is transformed into an electrical signal, which is further processed by a custom-made dedicated electronic module (Resistell AG) and recorded using a data acquisition card (USB-6212; National Instruments, Austin, TX, USA). The device is controlled using a dedicated AST software (custom-made, Resistell AG).

The custom-made sensors used for the described experiments (Resistell AG) contain quartz-like tipless cantilevers with a gold coating acting as a mirror for the light beam (SD-qp-CONT-TL, spring constant: 0.1N/m, length width thickness: 130400.75m, resonant frequency in air: 32kHz; NanoWorld AG, Neuchtel, Switzerland). During an AST experiment, bacterial nanoscale movements actuate the cantilever to deflect in specific frequencies and amplitudes.

For the development of temperature-controlled experiments with CZA at 37C, we used modular NanoMotion Device (NMD) prototypes. It allowed the reconfiguration of the hardware setup to work with either a standard incubator or a modified measurement head to warm up only the measurement chamber. For the merge of an NMD with a BINDER BD 56 incubator, the size of the incubator fits the entire NMD head with the active vibration damping module, also permitting the user a comfortable manual operation. The incubator shelf was rigid and able to hold the vibration isolator and NMD head (ca. 10kg), and the incubator was modified with an access port to pass through control cables operating the light source, photodetector, and vibration damping module from the outside. Another NMD prototype was equipped with a locally-heated measurement chamber, thermally insulated from the measurement head set-up. A Peltier module as a heating element was installed under the measurement chamber, adapted to temperature control by adding a Pt100 temperature sensor. Temperature was kept at 37C by a Eurotherm EPC3016 PID controller (Eurotherm Ltd, Worthing, United Kingdom) and a custom-made Peltier module driver. Both setups had a temperature stability <0.2C, which is a matching requirement for stable culture conditions.

Each sampled nanomotion signal was split into 10s timeframes. For each timeframe, the linear trend was removed and the variance of the residue frame was estimated. For some experiments, the variance signal was too noisy for classification, necessitating the application of an additional smoothing procedure. A running median with a 1min time window was applied to smooth the variance signal and allow plot interpretation. For the calculation of the SP slope of the variance in the drug phase used for determining the nanomotion dose response in Fig.2b and Supplementary Fig.4, we used the formula log(x)=log(C) + at, where t is time (in min), a is the slope of the common logarithm of the variance trend, and log(C) is the intercept. Variance plots were used here for the visual inspection of results, and are currently the primary tool accessible for investigators. However, more sophisticated SPs are necessary for reliably classifying phenotypes in ASTs.

Nanomotion-based AST was performed using Resistell Phenotech devices (Resistell AG, Muttenz, Switzerland) on a standard laboratory benchtop. Each recording comprises two phases: a 2-h medium phase and a 2-h drug phase. In addition, a short blank phase is conducted to measure the baseline deflections of a new, bare, functionalized cantilever in 50% LB medium for 510min. Raw nanomotion recordings were used to develop classification models using machine learning.

The signal during the blank phase is expected to be constant and primarily flat (variance around 2.6 E-6 or lower). Higher median values or the clear presence of peaks are indicators of potential contamination of the culture medium inside the measurement fluid chamber, sensor manufacturing errors, or an unusual external environmental noise source that should be identified and rectified. In particular, contamination (OD600<0.01) can cause deflection signals that are several orders of magnitude higher than expected for sterile media due to interactions between floating particles in the fluid chamber and the laser beam. The blank phase serves as a quality control but is not used for classification models and, therefore, can be performed several hours prior to recording medium and drug phases.

The medium phase records cantilever deflections after bacterial attachment, showing the oscillations caused by natural bacterial nanomotions stemming from metabolic and cellular activity. Here, variance is expected to be greater (105 to 103) than during the blank phase. The 2-h medium phase duration allows cells to adapt to their new environment within the fluid chamber and generates a baseline that can be compared to bacterial nanomotions during the drug phase. The drug phase measures cellular vibrations after an antibiotic has been introduced to the fluid chamber. The antibiotic is directly pipetted into the medium already present within the measurement chamber.

The Phenotech device detects nanomotion signals resulting from the activity of living cells. However, other sources can create detectable noise during cantilever-based sensing61. Thermal drift occurring on the cantilever62, as well as external sources such as acoustic noise and mechanical vibrations, can all impact measurements. Distinguishing cell-generated vibrations from background noise can be challenging. As such, we employed a supervised machine learning-based approach to extract signal parameters (SPs) containing diagnostic information while minimizing overall background noise. The entire procedure of analyzing motional activity of particles is part of a filed patent (PCT/EP 2023/055596).

First, a batch of initial SPs related to frequency and time domains were extracted, with time and frequency resolution being high to allow for further statistical analysis at this level. Next, different statistical parameters were created with a much coarser time and frequency scale. Finally, various combinations (differences, ratios, etc.) were calculated, forming a final batch of SPs that are more related to antibiotic susceptibility. SPs were estimated for experiments with cells and conditions with well-defined and known outputs (e.g., susceptibility to a given antibiotic could be known through reference AST methods). Here, extracted SPs and outputs formed labeled datasets that could be used for supervised machine learning.

A feature selection algorithm extracted SPs related to the phenomenon of interest. These SPs were selected from the overall batch of SPs to optimize the performance of this so-called machine learning model. In this case, the model was a classifier validated by analysis of metrics measuring the degree of distinguishing antibiotic susceptibility. Therefore, a forward selection method was applied. All SPs were subsequently evaluated in the classifier with repeated stratified cross-validation. The SPs that enabled the classifier to reach maximal accuracy were added to the stack of selected SPs and deleted from the remaining SPs. In the next iteration, all remaining SPs were again tested with the already-selected SPs. The best-performing SP was again added to the selected SP stack. This process was repeated several times until the overall performance reached a plateau or a predefined number of SPs were selected. In the final model (iii), these newly found SPs were then used as machine learning model features. Classifier models were trained using the complete available dataset and could now be used to classify previously unseen data. The Supplementary information elaborates in more detail on that process and lists all SPs used in the different classification models.

After achieving Pareto optimality, the models were tested on independent test datasets consisting exclusively of strains of K. pneumoniae or E. coli that were not used in the training of the corresponding model. We used either spiked blood cultures or directly anonymized remnant PBC from the Lausanne University Hospital (CHUV) in Lausanne. Spiking was predominantly utilized to increase the fraction of resistant strains to obtain more representative specificity (classification performance of resistant strains), as resistance rates at that hospital are around 10 % for CRO and CIP and close to non-existent for CZA. Each nanomotion recording was classified separately and combined using the median to a sample reporting accuracy, sensitivity and specificity exactly as described for reporting the training performance.

In addition to this, we performed an interim analysis of the multicentric clinical performance study PHENOTECH-1 (NCT05613322), conducted in Switzerland (Lausanne University Hospital, Lausanne), Spain (University Hospital Ramn y Cajal, Madrid) and Austria (Medical University of Innsbruck, Innsbruck). The study evaluates the performance of the nanomotion AST with the Phenotech device using the CRO model on E. coli and K. pneumoniae from fresh residual PBC. Ethical review and approval were obtained by the hospital ethics committee at each participating site. In Lausanne and Innsbruck, only samples from patients who had previously agreed to the use of their residual biological material were utilized. In Madrid, consent for participation was not required for this study in accordance with institutional requirements. No compensation was paid to participants. The interim results reported here comprise the first included 85 samples with complete data entry. The eventual sample size of 250 was estimated based on the expected rate of E. coli and K. pneumoniae samples susceptible to the antibiotic in the three countries (i.e., 80%). Allowing for up to 10% samples with missing data or technical errors, an overall sample size of 250 would include 180 truly susceptible samples with 98% power to demonstrate that sensitivity is at least 90%. The PHENOTECH-1 study is expected to conclude in 2024. The endpoints of this study include the accuracy, sensitivity, and specificity of the device according to ISO-20776-2 (2021), as well as the time to result from the start of the AST to the generation of the result in form of a time stamped report. Regarding inclusion criteria, patients aged 18 years or older, with positive blood cultures for either E. coli or K. pneumoniae, are eligible for participation in the study. Additionally, Phenotech AST needs to be performed within 24h of the blood culture turning positive. Patients with polymicrobial samples are excluded from the study.

Qualitative results of the Kirby Bauer disk diffusion assay, i.e., either R or S, were used for benchmarking. Clinical breakpoints for the class definition were according to EUCAST in 2022. The samples coming from one PBC were measured in technical triplicates for 4h. The results from each recording were automatically combined to a sample. Instead of the median score, a majority voting system was in place that is, RRR, RRS and RR- return predicted resistance, SSS, SSR, SS- return susceptibility. In this way even if one recording needed to be excluded because of technical errors, or detection of substantial elongation of the specimen, the sample could be interpretated. Only if two or more recordings were excluded, or the exclusion of one recording resulted in the disagreement between the two remaining recordings, the sample would be classified as non-conclusive. The experiments were not randomized and the investigators were unblinded during experiments and outcome assessment. Information on sex, gender, and age of participants was not collected in this study as having no impact on the generalizability and translation of the findings. At the time of analysis, the data set included 119 samples, of which 12 screening failures, 5 with technical errors or elongation, and 12 incomplete/unverified. Samples with complete, verified and cleaned data accounted to 90. Of these, the first 85 samples were selected of which 20 samples derived from CHUV, 48 samples from Ramon y Cajal Hospital and 17 samples from Medical University of Innsbruck.

Statistical details can be found in the figure legends. Data are presented as mean or medianSD or representative single experiments and provided in the Source data file. In Figs.3, 5, and 6, the performance calculation is based on single recordings for which a score was calculated. Each recording is depicted as a datapoint representing a biological replicate originating from a different PBC. Performance calculation in Fig.4 is based on the median of the scores calculated for each technical replicate originating from the same PBC. Thus, each datapoint represents the median score as it is currently implemented in the PHENOTECH-1 clinical performance study. In each case, scores are logits predicted by the corresponding logistic regression model. In Fig.5e the two-tailed MannWhitney U test was performed for calculating a p-value. Statistical analysis and graphs were generated with GraphPad Prism 10.

Further information on research design is available in theNature Portfolio Reporting Summary linked to this article.

The rest is here:
Accurate and rapid antibiotic susceptibility testing using a machine learning-assisted nanomotion technology platform - Nature.com

AI reveals the complexity of a simple birdsong – The Washington Post

To a human ear the songs of all male zebra finches sound more or less the same. But faced with a chorus of this simple song, female finches can pick the performer who sings most beautifully.

Zebra finches are found in Australia, and they usually mate monogamously for life making this a high-stakes decision for the female finches. The zebra finch is among about a third of songbirds who learn a single song from their fathers early in life, and sing it over and over, raising the question of how female songbirds distinguish between them to choose a mate.

Listen to the song of a male zebra finch:

Scientists believe that most male songbirds evolved to sing a variety of songs to demonstrate their fitness. Under that theory, the fittest songbirds will have more time and energy to work on their vocal stylings and attract females with their varied vocal repertoire.

New research using machine learning shows finches may be sticking to one tune, but how they sing it makes a big difference. Published Wednesday in the journal Nature, the study reveals the complexity of a single zebra finch song and what female songbirds might be hearing in their prospective mates seemingly simple songs.

When researchers analyze birdsongs, theyre often not listening to them but rather looking at spectrograms, which are visualizations of audio files.

So I put together that, Hey, what humans are doing is looking at images of these audio files. Can we use machine learning and deep learning to do this? said Danyal Alam, the lead author on the new study and a postdoctoral researcher at the University of California at San Francisco.

Alam, along with Todd Roberts, an associate professor at UT Southwestern Medical Center and another colleague, used machine learning to analyze hundreds of thousands of zebra finch songs to figure out how they were different from each other and which variations were more attractive to female songbirds.

The researchers found one metric that seemed to get females attention: the spread of syllables in the song. The females seemed to prefer longer paths between syllables. This isnt something humans can easily pick up by listening to the songs or looking at the spectrograms but based on how these algorithms mapped the syllables, the researchers were able to see them in a new way.

To check their hypothesis, the researchers brought the findings back to the birds.

They generated synthetic bird songs to see if females preferred those with a longer path and they did, suggesting the birds intended audience picked up on the same pattern as the researchers computers.

Listen to see if you can tell the difference between a synthetic finch song that doesnt spread out its syllables:

Alam and his colleagues also found that baby birds had a harder time learning the long-distance song patterns than the shorter ones which suggests fitter birds would be more able to learn them, the researchers said.

The studys finding is consistent with whats been shown in other species: The more complexity or difficulty in a song, the more appealing it is to female birds.

A lot of signals in animal communication are meant to be an honest signal of some underlying quality, said Kate Snyder, a researcher at Vanderbilt who wasnt involved in the new paper.

For example, she said, if you look at a peacock, you see the male birds with the longer and more beautiful tails are better at attracting mates. Maintaining a tail like that is expensive for the bird so it must be good at finding food and surviving in its environment to have the time to devote to keeping its tail nice.

Learning takes a lot of time, energy, brain space, Snyder said. Only the fittest male birds will have the time and energy to devote to learning to sing.

Among finches, that work has just been harder to spot until now.

We used to think of this single song repertoire as perhaps a simple behavior, said Roberts. But what we see is that its perhaps much more complicated than we previously appreciated.

See more here:
AI reveals the complexity of a simple birdsong - The Washington Post

Undergraduate Researchers Help Unlock Lessons of Machine Learning and AI – College of Natural Sciences

Brain-Machine Interface

AI also intersects with language in other research areas. Nihita Sarma, a computer sciencethird-year student and member of Deans Scholars and Turing Scholars, researches theintersection of neuroscience and machine learning to understand language in the brain, workingwith Michael Mauk, professor of neuroscience, and Alexander Huth, an assistant professor ofcomputer science and neuroscience.

As research subjects listen to podcasts, they lie in an MRI machine and readings track their brainactivity. These customized-to-the-subject readings are then used to train machine learningmodels called encoding models, and Sarma then passes them through decoding models.

My research is taking those encodings and trying to backtrack and figure out based on thisneural representation based on the brain activity that was going on at that moment whatcould the person inside the MRI machine possibly have been thinking or listening to at thatmoment? Sarma said.

Along with gaining a better understanding of how language is represented in the brain, Sarmasaid the research has possible applications for a noninvasive communication tactic for peopleunable to speak or sign.

We would be able to decode what theyre thinking or what theyre trying to say, and allow themto communicate with the outside world, Sarma said.

Read more here:
Undergraduate Researchers Help Unlock Lessons of Machine Learning and AI - College of Natural Sciences

Machine Learning Accelerates the Simulation of Dynamical Fields – Eos

Editors Highlights are summaries of recent papers by AGUs journal editors. Source: Journal of Advances in Modeling Earth Systems

Accurately simulating and appropriately representing the aerosol-cloud-precipitation system poses significant challenges in weather and climate models. These challenges are particularly daunting due to knowledge gaps in crucial processes that occur at scales smaller than typical large-eddy simulation model grid sizes (e.g., 100 meters). Particle-resolved direct numerical simulation (PR-DNS) models offer a solution by resolving small-scale turbulent eddies and tracking individual particles. However, it requires extensive computational resources, limiting its use to small-domain simulations and limited number of physical processes.

Zhang et al. [2024] develop the PR-DNS surrogate models using the Fourier neural operator (FNO), which affords improved computational performance and accuracy. The new solver achieves a two orders of magnitude reduction in computational cost, especially for high-resolution simulations, and exhibits excellent generalization, allowing for different initial conditions and zero-shot super resolution without retraining. These findings highlight the FNO method as a promising tool to simulate complex fluid dynamics problems with high accuracy, computational efficiency, and generalization capabilities, enhancing our ability to model the aerosol-cloud-precipitation system and develop digital twins for similarly high-resolution measurements.

Citation: Zhang, T., Li, L., Lpez-Marrero, V., Lin, M., Liu, Y., Yang, F., et al. (2024). Emulator of PR-DNS: Accelerating dynamical fields with neural operators in particle-resolved direct numerical simulation. Journal of Advances in Modeling Earth Systems, 16, e2023MS003898. https://doi.org/10.1029/2023MS003898

Jiwen Fan, Editor, JAMES

Read more:
Machine Learning Accelerates the Simulation of Dynamical Fields - Eos

Inter hospital external validation of interpretable machine learning based triage score for the emergency department … – Nature.com

Study design and setting

This retrospective and validation study was executed across from 3 ED in Korea (A, B and C). A, B and C are tertiary hospitals located in a metropolitan city in Korea. Respectively, the hospital has approximately 2000, 1000, and 1000 inpatient beds. Approximately more than 80,000, 90,000 and 50,000 patients visit the ED annually. There are 16, 20 and 7 specialists working at each institution, respectively. All data were mapped to the Observational Medical Outcome Partnership Common Data Model (OMOP-CDM) for the multicenter study. This study was approved by the Samsung Medical Center Institutional Review Board (2023-02-036), and a waiver of informed consent was granted for EHR data collection and analysis because of the retrospective and de-identified nature of the data. All methods were performed in accordance with the relevant guidelines and regulations.

Initially, ED patients from 2016 to 2017 were included for each hospital. Patient older than 18 with disease patients were included. We also excluded patient with left without being seen or death on arrival/cardiopulmonary resuscitation patients. We split into two cohort: development (70%) cohort for training the interpretable ML model and test (30%) for evaluation from each hospital.

We extracted data from each hospitals electronic medical records system which all patient information was deidentified. Candidate input variables were considered with available features at the stage of ED triage including demographic characteristics such as age, gender, administrative variables including time of ED visit and clinical variables such as severity index, consciousness, and initial vital sign. Comorbidities were also obtained from hospital diagnosis records in the preceding 5years before patients emergency visit and compared for each hospital. They were extracted from International Statistical Classification of Diseases and Related Health Problems, Tenth Revision(ICD-10). The list and description of candidate predictors and comorbidities are given in the supplementary Tables6 and 7.

Emergency patients with semi-acute conditions typically undergo surgical procedure or are admitted to Intensive care unit (ICU) following emergency room treatment and given the imperative for patients to survive. Our primary outcome was 2-day mortality which was the target feature for analysis to build the interpretable ML model for each hospital.

For the multicenter study, we adopted OMOP CDM from the research network Observational Health Data Sciences and Informatics (OHDSI)28 for standardized structure and vocabularies to map emergency department data based on Systematized Nomenclature of MedicineClinical Terms (SNOMED-CT) and Logical Observation Identifiers Names and Codes (LOINC) as example shown Supplementary Fig.1. Extract, Transformation and Load (ETL) process was performed with structured query language. Each ED care and diagnosis related information was mapped into proper CDM tables as shown in Fig.2. For example, patient demographics and vital sign are mapped to Person and Measurement table, respectively. After transformation was completed into CDM format, all hospital can get the same structure and vocabularies, for executing same research query. All details of transformation and code are accessible on Gitgub29.

Table mapping for converting clinical to common data model tables. CDM: common data model; ED: Emergency department.

AutoScore Framework is a machine learning-based clinical score generator, consisting of six modules developed from Singapore12. Module 1 uses a random forest for ranking variables according to their importance. Module 2 transforms variables by categorizing continuous variables to improve interpretation with quantile information. Module 3 makes scores for each variable based on a logistic regression coefficient. Module 4 selects which variables could be included in the scoring model. In Module 5, clinical domain knowledge is incorporated to the score and cutoff points can be defined when categorizing continuous variables. Module 6 evaluates the performance of the score in a separate test dataset. The AutoScore framework provides a systematic and automated approach to develop score automatically, combining of advantage of machine learning for discriminating and the strength of logistic regression in its interpretability. For the overall score generation, We considered weighted average scores across all institutions. For each institutionsi, a weight({w}_{i})was formulated as ({w}_{i})=(left(sqrt{{(AUC}_{i})} times {N}_{i}^{3}right))/({sum }_{i=1}^{M}sqrt{{(AUC}_{i})} times {N}_{i}^{3}))100%where({N}_{i})was the sample size,({AUC}_{i}) was the AUC value obtained based on the validation set, andMwas the total number of institutions. Overall score was calculated with weighted score based on ({w}_{i}).

We defined our new novel framework CDM Autoscore for ED, combination of CDM based standardized format and autoscore based interpretable framework shown in Fig.3. The analysis and preparation code using CDM format was also shared on GitHub29.

Overall process of CDM Autoscore for ED. Each Institutions conducted Extract, Transformation and Load process for converting local data into CDM format. Algorithms from each of institution were derived using interpretable machine learning framework and validated inter-and intra- institutionally. EMR: Electronic medical records; ETL: Extract, transformation and Load; OMOP CDM: Observational Medical Outcome Partnership Common Data Model.

Categorical features were expressed as frequency and percentages and continuous features were expressed as means and standard deviations. Comparison tests for each hospital were performed with analysis of variance and chi-square tests at 5% significance levels. Standardized mean difference (SMD) was also calculated for comparing each hospital. Two types of validations for this study were conducted. First, we executed internal-institutional validation for each hospitals score. We also performed intra-institutional validation pair-wisely for the external validation. Area under the curve in the receiver operating characteristic (AUROC) and 95% confidence interval (CI) with 1000 times of bootstrap was reported. Other metrics including accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were also reported. SMOTE was conducted for handling the imbalance problem. Twice of minority was oversampled and same number of majorities according to the number of minority was sampled with fixed seed number.

Read more:
Inter hospital external validation of interpretable machine learning based triage score for the emergency department ... - Nature.com