Quantum deep reinforcement learning for clinical decision support in oncology: application to adaptive radiotherapy | Scientific Reports – Nature.com
Quantum deep reinforcement learning
Quantum deep reinforcement learning is a novel action value-based decision-making framework derived from QRL23 and deep q-learning10 framework. Like conventional RL9,31, our qDRL based CDSS framework is comprised of 5 main elements: clinical AI agent, ARTE, radiation dose decision-making policy, reward, and q-value function. Here, the AI agent is a clinical decision-maker that learns to make dose decisions for achieving clinically desirable outcomes within the ARTE. The learning takes place by the agent-environment interaction, which can be sequentially ordered as: the AI decides on a dose and executes it, and in response, a patient (part of the ARTE) transits from one state to the next. Each transition provides the AI with feedback for its decision in terms of RT outcome and associated reward value. The goal of RL is for the AI to learn a decision-making policy that maximizes the reward in the long run, defined in terms of a specified q-value function that assigns a value to every state-dose-decision pair obtained from the accumulation of rewards over time (returns).
Assuming Markovs property (i.e., an environments response at time (t + 1) depends only on the state and dose-decision at time (t)), the qDRL task can be mathematically described as a 5-tuple ((S, left| D rightrangle , TF, P, R)), where (S) is a finite set of patients states, (left| D rightrangle) is a superimposed quantum state representing the finite set of eigen-dose decision, (TF:S times D to S^{prime }) is the transition function that maps patients state (s_{t}) and eigen-dose (left| d rightrangle_{t}) to the next state (s_{t + 1}), (P_{LC|RP2} :S^{prime } to left[ {0,1} right]) is the RT outcome estimator that assigns probability values (p_{LC}) and (p_{RP2}) to the state (s_{t + 1}), and (R:left[ {0,1} right] times left[ {0,1} right] to {mathbb{R}}) is the reward function that assigns a reward (r_{t + 1}) to the state-decision pair (left( {s_{t} ,left| d rightrangle_{t} } right)) based on the outcome probability estimates.
Eigen-dose (left| d rightrangle) is a physically performable decision that is selected via quantum methods from the superimposed quantum state (left| D rightrangle) which simultaneously represents all possible eigen-doses at once. In simple words, (left| D rightrangle) is the collection of all possible dose options and (left| d rightrangle) is one of those options which is selected after a decision is made. Selecting dose decision (left| d rightrangle) is carried out in two steps: (1) amplifying the optimal eigen-dose (left| d rightrangle^{*}) from the superimposed state (left| D rightrangle) (i.e., (left| D rightrangle^{prime } = widehat{Amp}_{{left| d rightrangle^{*} }} left| D rightrangle)) and (2) measuring the amplified state (i.e., (left| d rightrangle = widehat{Measure}(left| {D^{prime } } rightrangle )).
The optimal eigen-dose (left| d rightrangle^{*}) is obtained from deep Q-net, which is the AIs memory. Deep Q-net, (DQN:S to {mathbb{R}}^{d}), is a neural network that takes patients state as input and then outputs q-value for each eigen-dose ((left{ {q_{left| d rightrangle } } right})). The optimal dose is then selected following greedy policy where the dose with the maximum q-value is selected (i.e., (left| d rightrangle^{*} = begin{array}{*{20}c} {argmax} \ {left| {d^{prime } } rightrangle } \ end{array} { q_{left| d rightrangle } })). We have applied a double Q-learning 32 algorithm in training the deep Q-net. The schematic of a training cycle is presented in Fig.2 and additional technical details are presented in the Supplementary Material.
We initially employed Grovers amplification procedure33,34 for the decision selection mechanism. While Grovers procedure works on a quantum simulator, it fails to correctly work in a quantum computer. The quantum circuit depth of Grovers procedure (for 4 or higher qubits) is much greater than the coherence length of the current quantum processor35. Whenever the quantum circuit length exceeds the coherence length, quantum state becomes significantly affected by the system noise and loses vital information. Therefore, we designed a quantum controller circuit that is shorter than the coherence length and is suitable for the task of decision selection. The merit of our design is its fixed length; since its length is fixed for any number of qubits, it is suitable for higher qubit systems, as much as permitted by the circuit width. Technical details regarding its implementation in quantum processor is presented in the Supplementary Materials.
An example of a controller circuit is given in Fig.5. Controller circuits use twice the number of qubits (n), which can be divided into control and main. Optimal eigen-states obtained from deep Q-net are created in the control by selecting the appropriate pre-control gates. Then the control is entangled with the qubits from the main via controlled NOT (CNOT) gates. CNOT gates are connected between a control qubit from the control and a target qubit from the main. CNOT gates flip the target qubit from (left| 1 rightrangle) to (left| 0 rightrangle) only when the control is in (left| 1 rightrangle) state and does not perform any operation otherwise. Because all the main qubits are prepared in (left| 0 rightrangle) state, we introduced the reverse gates (n X-gates in parallel) to flip them to (left| 1 rightrangle). X-gates flip (left| 0 rightrangle) to (left| 1 rightrangle), and vice-versa. The CNOT flips all the qubits whose controls are in (left| 1 rightrangle) state, creating a state that is element-wise opposite to the marked state. Finally, another set of reverse gates is applied to the main before making a measurement.
Quantum controller circuit for a 5 qubit (32 bit) system. (a) Quantum controller circuit for the selection of the state (left| {10101} rightrangle). The probability distribution corresponding to (b) failed Grovers amplification procedure for one iteration run in the 5-qubit IBMQ Santiago quantum processor and (c) successful quantum controller selection run in the 15-qubit IBMQ Melbourne quantum processor.
Another advantage of the controller circuit is controlled uncertainty level. The controller circuit has additional degrees of freedom that can control the level of uncertainty that might be needed to model a highly dubious clinical situation. By replacing the CNOT gate by a more general (CU3left( {theta ,phi ,lambda } right)) gate, we can control the level of additional stochasticity with the rotation angles (theta), (phi), and (lambda), which corresponds to the angles in the Bloch sphere. The angles can either be fixed or, for additional control, changed with training episode.
The patients state in the ARTE is defined by 5 biological features: cytokine (IP10), PET imaging feature (GLSZM-ZSV), radiation doses (Tumor gEUD and lung gEUD), and genetics (cxcr1- Rs2234671). Their descriptions are presented in Table 2. These 5 variables were selected from a multi-objective Bayesian Network study13, which considered over 297 various biological features and found the best features for predicting the joint LC and RP2 RT outcomes.
The training data analyzed in this study are obtained from the University of Michigan study UMCC 2007.123 (NCI clinical trial NCT01190527) and the validation data analyzed in this study are obtained from the RTOG-0617 study (NCI clinical trial NCT00533949). Both trials were conducted in accordance with relevant guidelines and regulations and informed consent was obtained from all subjects and/or legal guardians. Details on training and validation datasets, and necessary model imputation carried out to accommodate the differences in the datasets are presented in the Supplementary Materials.
Deep Neural Networks (DNN) were applied as transition functions for IP10 and GLSZM-ZSV features. They were trained with a longitudinal (time-series) dataset, with the pre-irradiation patient state and corresponding radiation dose as input features and post-irradiation state as output. For lung and tumor gEUD, we utilized prior knowledge and applied a monotonic relationship for the transition function since we know that gEUD should increase with increasing radiation dose. We assumed that the change in gEUD is proportional to the dose fractionation and tissue radiosensitivity,
$$frac{{gleft( {t_{n} } right) - gleft( {t_{n - 1} } right)}}{{t_{n} - t_{n - 1} }} propto d_{n} left( {1 + frac{{d_{n} }}{{frac{alpha }{beta }}}} right).$$
(1)
Here (gleft( {t_{n} } right)) is the gEUD at time point (t_{n}), (d_{n}) is the radiation dose fractionation given during the nth time period, and (alpha /beta) ratio is the radiosensitivity parameter which differs between tissue type. Note that we first applied constrained training42 to maintain monotonicity with DNN model, however the gEUD over time trend was flatter than anticipated, thus we opted for a process-driven approach in the final implementation. The technical details on the NNs and its training are presented in the Supplementary Material.
DNN classifiers were applied as the RT outcome estimator for LC and RP2 treatment outcomes. They were trained with post irradiation patient states as input and binary LC and RP2 outcomes as its labels.
RT outcome estimator must also satisfy a monotone condition between increasing radiation dose and increasing probability of local control as well as probability of radiation induced pneumonitis. To maintain this monotonic relationship, we used a generic logistic function,
$$p_{LC|RP2} = frac{1}{{1 + exp left( {frac{{gleft( {t_{6} } right) - mu }}{T}} right)}},$$
(2)
where (gleft( {t_{6} } right)) is the gEUD at week 6, and (mu) and (T) are two patient-specific parameters that are learned from training the DNN. Here, (mu) and (T) are the outputs of two neural networks that are fed into the logistic function and tuned one after the other, leaving the other fixed. The training details are presented in the Supplementary Materials.
The task of the agent is to determine the optimal dose that maximizes (p_{LC}) while minimizing (p_{RP2}). Accordingly, we built a reward function on the base function (P^{ + } = P_{LC} left( {1 - P_{RP2} } right)) as shown in Fig.6. The algebraic form is as follows,
$$R = left{ {begin{array}{*{20}l} {P^{ + } + 10 } hfill & { {text{if}} 70% < p_{Lc} < 100% ;{text{and}}; 0% < p_{RP2} < 17.2% } hfill \ {P^{ + } + 5} hfill & {{text{if}} 50% < p_{Lc} < 70% ;{text{and}}; 17.2% < p_{RP2} < 50% } hfill \ {P^{ + } - 1} hfill & {{text{if}} 0% < p_{Lc} < 50% ;{text{and}}; 50 < p_{RP2} < 100% } hfill \ end{array} } right.$$
(3)
Reward function for reinforcement learning. Contour plot of reward function as a function of the probability of local control (PLC) and radiation induced pneumonitis of grade 2 or higher (PRP2). Area enclosed by the blue line corresponds to the clinically desirable outcome, i.e., (P_{LC} > 70{%}) and ({P_{RP2}} <17.2{%}). Similarly, the area enclosed by the green lines corresponds to the computationally desirable outcome, i.e., (P_{LC} > 50{%}) and ({P_{RP2}} <50{%}). Along with (P_{LC} times (1-P_{RP2})) the AI agent receives+10 reward for achieving clinically desirable outcome,+5 for achieving computationally desirable outcome, and -1 when unable to achieve a desirable outcome.
Here the AI agent receives additional 10 points for achieving clinically desirable outcome (i.e., (p_{LC} > 70% quad {text{and}} quad p_{RP2} < 17.2%)), 5 points for achieving computationally desirable outcome (i.e., (p_{LC} > 50% quad {text{and}} quad p_{RP2} < 50%)), and -1 point for failing to achieve a desirable outcome altogether. The negative point motivates the AI agent to search for the optimal dose as soon as possible.
To compensate for low number of data points we employed WGAN-GP43, which learns the underlying data distribution and generates more data points. We generated 4000 additional data points for training qDRL models. Having a larger training dataset helps the reinforcement learning algorithm in accurately representing the state space. The training details are presented in the Supplementary Material.
See the rest here:
Quantum deep reinforcement learning for clinical decision support in oncology: application to adaptive radiotherapy | Scientific Reports - Nature.com
- Quantum Technologies Forum navigates present and future of quantum at USC - University of Southern California - November 16th, 2024 [November 16th, 2024]
- New 'gold-plated' superconductor could be the foundation for massively scaled-up quantum computers in the future - Livescience.com - November 16th, 2024 [November 16th, 2024]
- Quantum Technologies Could Have 8 Billion of Impact on UK Transport by 2035 - The Quantum Insider - November 16th, 2024 [November 16th, 2024]
- IBM launches R2 Heron processors that performs 5,000 two-qubit gate operations - Inceptive Mind - November 16th, 2024 [November 16th, 2024]
- Rigetti Computing Reports Third Quarter 2024 Financial Results and Business Updates - GlobeNewswire - November 16th, 2024 [November 16th, 2024]
- Qiskit Fall Fest brings the fun to quantum technology - The Lafayette - November 16th, 2024 [November 16th, 2024]
- Quantum computers touted as AI accelerator at Daesung Haegang Science Forum - The Korea JoongAng Daily - November 16th, 2024 [November 16th, 2024]
- IonQ Strengthens Technical Moat with its Latest Series of Issued Patents - Business Wire - November 16th, 2024 [November 16th, 2024]
- RIKEN, NTT, and Amplify Inc. Introduce General-Purpose Optical Quantum Computer - The Quantum Insider - November 12th, 2024 [November 12th, 2024]
- The Incredible Power of Quantum Memory - WIRED - November 10th, 2024 [November 10th, 2024]
- What Is Quantum AI? Everything to Know About This Far-Out Twist - CNET - November 10th, 2024 [November 10th, 2024]
- IonQ to Increase Performance and Scale of Quantum Computers with Photonic Integrated Circuits in Collaboration with imec - Yahoo Finance - November 10th, 2024 [November 10th, 2024]
- Why IonQ Stock Is Skyrocketing Today - The Motley Fool - November 10th, 2024 [November 10th, 2024]
- Weighty Subject: Is The Universe a Giant Quantum Gravity Computer? - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- Massachusetts is launching a new quantum computing project. An expert explains why that's a big deal not just for the state but the world -... - November 10th, 2024 [November 10th, 2024]
- IonQ Strengthens Quantum Computing Capabilities through Partnerships with imec and NKT Photonics - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- Quantum Computing Inc. 3Q Report: Focus on Loss Reduction While Building Partnerships - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- Chasing Impossible Vortices: Supersolid Discovery and the Future of Quantum Technology - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- IonQ and Ansys Partner to Integrate Quantum Computing for Accelerating CAE Simulations and Also to Use Ansys Tools for Designing Ions Quantum... - November 10th, 2024 [November 10th, 2024]
- IonQ to Increase Performance and Scale of Quantum Computers with Photonic Integrated Circuits in Collaboration with imec - Business Wire - November 10th, 2024 [November 10th, 2024]
- Calling All Gamers: Valens Games Reimagination of Gaming for a World With LLM, AI, and Quantum Computing - HSToday - November 10th, 2024 [November 10th, 2024]
- IBM, Guarding Against Tomorrows Threats Today - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- Yonsei University Establishes South Koreas First 127-Qubit Quantum Computing Center for Industry and Research - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- Building the future of chips in the USA - IBM Research - November 10th, 2024 [November 10th, 2024]
- Chinese superconducting quantum computing power sold to overseas client - Global Times - November 10th, 2024 [November 10th, 2024]
- IonQ's Third-Quarter Results: Revenue Guidance Raised Amid Strategic Acquisitions, Partnerships - The Quantum Insider - November 10th, 2024 [November 10th, 2024]
- ASEAN FinTech funding grew more than 10-fold in past decade, GenAI and Quantum Computing to power new era: FinTech in ASEAN 2024 report - Yahoo... - November 10th, 2024 [November 10th, 2024]
- Ansys and IonQ Are Bringing the Power of Quantum to the $10 Billion Dollar Computer-Aided Engineering Industry - Business Wire - November 8th, 2024 [November 8th, 2024]
- Computer Engineering faculty awarded to advance the compilation process in quantum computing - Rochester Institute of Technology - November 8th, 2024 [November 8th, 2024]
- Ansys and IonQ Are Bringing the Power of Quantum to the $10 Billion Dollar Computer-Aided Engineering Industry - StockTitan - November 8th, 2024 [November 8th, 2024]
- Quantum Machines and Nvidia use machine learning to get closer to an error-corrected quantum computer - TechCrunch - November 4th, 2024 [November 4th, 2024]
- Quantum computers are here but why do we need them and what will they be used for? - Livescience.com - November 2nd, 2024 [November 2nd, 2024]
- Rigetti and Riverlane Achieve Real-Time Quantum Error Correction on 84-Qubit System - The Quantum Insider - November 2nd, 2024 [November 2nd, 2024]
- Quantum Computing Announces Strategic Partnerships and Pre-Orders Ahead of 2025 Foundry Opening - Yahoo Finance - November 2nd, 2024 [November 2nd, 2024]
- Where Will IonQ Be in 3 Years? - The Motley Fool - November 2nd, 2024 [November 2nd, 2024]
- In the Fight Against Noisy Quantum Computing, CVaR Proves a Worthy Opponent - The Quantum Insider - November 2nd, 2024 [November 2nd, 2024]
- Riverlane CEO Asks: What Will We Do With Error-Corrected Quantum Computers? - The Quantum Insider - November 2nd, 2024 [November 2nd, 2024]
- Gulf bets on a quantum computing leap - Arabian Gulf Business Insight - November 2nd, 2024 [November 2nd, 2024]
- Fully Operational Rigetti QPU Included in UKs Recently Opened National Quantum Computer Centre - GlobeNewswire - November 2nd, 2024 [November 2nd, 2024]
- Guest EditorialQuantum Computing: A Beacon of Transformation for the Oil and Gas Industry - Society of Petroleum Engineers (SPE) - November 2nd, 2024 [November 2nd, 2024]
- A Race to The End of Time - Brown Political Review - November 2nd, 2024 [November 2nd, 2024]
- Study observes a phase transition in magic of a quantum system with random circuits - Phys.org - November 2nd, 2024 [November 2nd, 2024]
- Securing tomorrow: What you should know about protecting data in the future - Clemson News - November 2nd, 2024 [November 2nd, 2024]
- Heres the paper no one read before declaring the demise of modern cryptography - Ars Technica - November 2nd, 2024 [November 2nd, 2024]
- Rigetti and Riverlane Progress Towards Fault Tolerant Quantum Computing with Real-Time and Low Latency Error Correction on Rigetti QPU - StockTitan - November 2nd, 2024 [November 2nd, 2024]
- NIST approves 14 new quantum encryption algorithms for standardization - Nextgov/FCW - November 2nd, 2024 [November 2nd, 2024]
- ORCA Computing Unveils The PT-2: Delivering Quantum-Enhanced Generative AI Capabilities - The Quantum Insider - November 2nd, 2024 [November 2nd, 2024]
- UK quantum computer cluster opens on site of Cold War atomic "holy of holies" - The Stack - November 2nd, 2024 [November 2nd, 2024]
- D-Wave Announces Appointment of Two New Board Members - Business Wire - November 2nd, 2024 [November 2nd, 2024]
- IonQs Quantum Surge: Ride the Wave or Cash Out? - MarketBeat - November 2nd, 2024 [November 2nd, 2024]
- D-Wave Deemed Awardable Vendor for US Department of Defense Chief Digital and Artificial Intelligence Offices Tradewinds Solutions Marketplace -... - November 2nd, 2024 [November 2nd, 2024]
- Challenges and opportunities in quantum optimization - Nature.com - November 2nd, 2024 [November 2nd, 2024]
- Quantum Computing, Inc. Announces Strategic Partnerships and Pre-Orders Ahead of 2025 Quantum Photonic Chip Foundry Opening - PR Newswire - November 2nd, 2024 [November 2nd, 2024]
- Bridging Cities with Quantum Links in Pursuit of the Quantum Internet - The Quantum Insider - November 2nd, 2024 [November 2nd, 2024]
- Quantum Computing, Inc. Announces Strategic Partnerships and Pre-Orders Ahead of 2025 Quantum Photonic Chip Foundry Opening - StockTitan - November 2nd, 2024 [November 2nd, 2024]
- UK's Newly Opened National Quantum Computing Centre Designed to Push The Boundaries of What is Possible With Quantum - The Quantum Insider - November 2nd, 2024 [November 2nd, 2024]
- Scientists build the smallest quantum computer in the world it works at room temperature and you can fit it on your desk - Livescience.com - October 24th, 2024 [October 24th, 2024]
- No, China Isnt a Decade Ahead of The U.S. in Quantum Computing (Probably) - The Quantum Insider - October 24th, 2024 [October 24th, 2024]
- Quantum Computing, Inc. to Host Third Quarter 2024 Shareholder Call on Wednesday, November 6, 2024 - StockTitan - October 24th, 2024 [October 24th, 2024]
- Quantum Computing, Inc. to Host Third Quarter 2024 Shareholder Call on Wednesday, November 6, 2024 - Quantisnow - October 24th, 2024 [October 24th, 2024]
- One Skyrmion to Rule Them All: Noise Resilience and Data Storage Solutions for Quantum Computing and Spintronics - The Quantum Insider - October 24th, 2024 [October 24th, 2024]
- Plotting the inevitable rise of quantum computing - Business Weekly - October 24th, 2024 [October 24th, 2024]
- The Netherlands to host an EU quantum computer in Amsterdam - DutchNews.nl - October 24th, 2024 [October 24th, 2024]
- Qubits Manipulated on the Fly - Physics - October 24th, 2024 [October 24th, 2024]
- Quantum Computing, Inc. to Host Third Quarter 2024 Shareholder Call on Wednesday, November 6, 2024 - WV News - October 24th, 2024 [October 24th, 2024]
- Scientists build the smallest quantum computer in the world it works at room temperature and you can fit it on your desk - MSN - October 24th, 2024 [October 24th, 2024]
- Scalable Silicon Spin Qubits Achieve Over 99% Fidelity for Quantum Computing with CMOS Technology - The Quantum Insider - October 24th, 2024 [October 24th, 2024]
- Multiverse Computing Expands to US with New San Francisco Office to Drive Quantum AI Adoption - HPCwire - October 24th, 2024 [October 24th, 2024]
- LUCI in The Surface Codes With Drop Outs: Google Quantum AI Researchers Report Framework Could Help Reduce Errors - The Quantum Insider - October 24th, 2024 [October 24th, 2024]
- Chinese scientists claim they broke RSA encryption with a quantum computer but there's a catch - Livescience.com - October 23rd, 2024 [October 23rd, 2024]
- Riverlanes Quantum Error Correction Report: Defining the Path to Fault-Tolerant Computing and the MegaQuOp Milestone - The Quantum Insider - October 23rd, 2024 [October 23rd, 2024]
- Quantum Computing, Inc. Enters Final Stage of Commissioning Quantum Photonic Chip Foundry in Tempe, Arizona - Yahoo Finance - October 23rd, 2024 [October 23rd, 2024]
- Why experts are warning businesses to prepare for quantum now or face critical cyber risks when it arrives - ITPro - October 23rd, 2024 [October 23rd, 2024]
- Quantum Computers Expected to Be Useful by 2026, Survey - IoT World Today - October 23rd, 2024 [October 23rd, 2024]
- ParTec AG and HZDR to Build AI Supercomputer Supporting Research in AI, Quantum Computing, and HPC - The Quantum Insider - October 23rd, 2024 [October 23rd, 2024]
- Pete Shadbolt on Tackling the Challenges of Quantum Computing & Its Future Impact on Everyday Life - The Quantum Insider - October 23rd, 2024 [October 23rd, 2024]
- How to build a quantum computer that's actually useful - Space Daily - October 23rd, 2024 [October 23rd, 2024]
- Quantum Algorithms for Faster Pattern Matching in Genomics and Text Processing, and Data-Intensive Applications - The Quantum Insider - October 23rd, 2024 [October 23rd, 2024]
- 2025 Tech Trends Report: New Insights on IT Investment in AI, Quantum Computing, and Cybersecurity Published by Info-Tech Research Group - PR Newswire - October 23rd, 2024 [October 23rd, 2024]
- Next Quantum Computer Comes To Netherlands - Mirage News - October 23rd, 2024 [October 23rd, 2024]