Skip to main content
SearchLoginLogin or Signup

Review 4: "Prospective Study of Machine Learning for Identification of High-risk COVID-19 Patients"

Reviewers praised the models' effectiveness in identifying less severe cases but suggested improvements in language consistency, figure descriptions, and experimental clarity. They also recommended more precise explanations of technical details.

Published onOct 29, 2024
Review 4: "Prospective Study of Machine Learning for Identification of High-risk COVID-19 Patients"
1 of 2
key-enterThis Pub is a Review of
Prospective study of machine learning for identification of high-risk COVID-19 patients
Prospective study of machine learning for identification of high-risk COVID-19 patients
Description

The Coronavirus Disease 2019 (COVID-19) pandemic constituted a public health crisis with a devastating effect in terms of its death toll and effects on the world economy. Notably, machine learning methods have played a pivotal role in devising novel technological solutions designed to tackle challenges brought forth by this pandemic. In particular, tools for the rapid identification of high-risk COVID-19 patients have been developed to aid in the effective allocation of hospital resources and for containing the spread of the virus. A comprehensive validation of such intelligent technological approaches is needed to ascertain their clinical utility; importantly, it may help develop future strategies for efficient patient classification to be used in future viral outbreaks. Here we present a prospective study to evaluate the performance of state-of-the-art machine-learning models proposed in PloS one 16, e0257234 (2021), which we developed for the identification of high-risk COVID-19 patients across four identified clinical stages. The model relies on artificial neural networks trained with historical patient data from Mexico. To assess their predictive capabilities across the six, registered, epidemiological waves of COVID-19 infection in Mexico, we measure the accuracy within each wave without retraining the neural networks. We then compare their performance against neural networks trained with cumulative historical data up to the end of each wave. Our findings indicate that models trained using early historical data exhibit strong predictive capabilities, which allows us to accurately identify high-risk patients in subsequent epidemiological waves—under clearly varying vaccination, prevalent viral strain, and medical treatment conditions. These results show that artificial intelligence-based methods for patient classification can be robust throughout an extended period characterized by constantly evolving conditions, and represent a potentially powerful tool for tackling future pandemic events, particularly for clinical outcome prediction of individual patients.

RR\ID Evidence Scale rating by reviewer:

  • Potentially informative. The main claims made are not strongly justified by the methods and data, but may yield some insight. The results and conclusions of the study may resemble those from the hypothetical ideal study, but there is substantial room for doubt. Decision-makers should consider this evidence only with a thorough understanding of its weaknesses, alongside other evidence and theory. Decision-makers should not consider this actionable, unless the weaknesses are clearly understood and there is other theory and evidence to further support it.

***************************************

Review: Authors main claim is train models using early historical data can exhibit strong predictive capabilities.

Below are some line edits:

  • Data:

    • However, for the sake of effectiveness and accuracy, only those characteristics demonstrating sufficient predictive power were included in our analysis”: How do you determine the predictive power of the variables before taking into the final consideration. Did you evaluate them by any statistical test?

    • information. Category 1 includes diabetes”: Category and Stage is same?

      Category and group is same? Please keep the same jargons throughout.

    • Category 2 includes gender, age, state of birth, state of residence, and age.”: Age is mentioned twice here

    • This implies that when carrying out a prospective study for patients in EW-5, there is uncertainty as to the specific day when a patient transitions from one stage to another.”: Not clear. Please paraphrase these lines or give more explanation.

  • Neural Network:

    • Supervised machine learning provides computer algorithms with the ability to learn from a known dataset to identify features and generate predictions about the outcome given a specific set of features, not included in the learning stage.”: What Supervised machine learning model you run? Did you compare the model performance between the different models run?

    • In this context, by making use of the publicly available database of Mexican COVID-19 patients, reference [17] reported on an artificial neural network capable…”: Why artificial neural network was used? Why not traditional supervised machine learning was used in this study?

    • …classifying patients into two classes: a) patients who are more likely to recover than to die or b) patients who are more likely to die than to recover.”: Binary classes: mortality or survived, right?

    • EW-2 through EW-5, irrespective of whether they lie within or outside the date span for the training data. To ensure a fair comparison in all cases, the neural network architecture is in all cases identical to that reported in Ref. [17].”: It is not clear that why you trained neural networks with the data from EW 2 through EW5? Why did you not include EW1 and EW6? It is also not clear that whether they lie within or outside the date span?

  • Result:

    • As mentioned earlier, we have carried out our prospective study in two phases”: But this is based on historical data?

    • In figure 3 we plot the number of new confirmed patients plotted vs date throughout the date span covered by the database, as was also done in Fig, 1a.”: Please describe in detail.

    • “…84.5% the clinical outcome for patients who are tracked from Stage 2, i.e.”: How did you get this value? Not clearly understandable.

  • Figure 1a and 1b:

    • 1a: There are 2 peaks out of the 25th percentile. Please describe more regarding this results/figure.

    • Caption: “gray regions indicate inter-pandemic periods (IP)”: How do you define IP, but many died in this period.

    • Needs more description for both figures

  • Figure 2:

    • Patient Evolution Graph: Not clear too. Diagram needs sub categories like a,b,c

    • Prospective Study Diagram: Needs more descriptions. What these arrow mean?

    • Caption: “flow diagram in the bottom-left panel illustrates”: Instead denoted by alphabetical orders: a,b,c.

    • Figures need more description. Please describe all sub-categories of figures

  • Table 2:

    • Should be left justified.

  • Table 3:

    • Wave: EW-3 + Section: 4: Why is this 0%?

    • Wave: EW-6 + Section: 4: What does NA% mean?

  • Figure 3:

    • Describe the color difference of the Figures 3. Describe details about the Figure. Describe the gray color in the Figure.

  • Figure 4:

    • Prediction accuracy plotted, for each of stages 1-4 as labeled, vs the length of the training data date span, indicated in the horizontal axis as the most recent wave included”: Not clear.

    • The accuracy values are obtained upon applying our neural networks to testing data resulting from our patient protocol tracking protocol…”: Not clear. May be the problem of language.

  • References:

    • “[16] A. S. S. Rao and J. A. Vazquez, Infection Control &Hospital Epidemiology 41, 826 (2020)”: Is this a book?

Overall comments:

  1. Check the language: A lot of confusing jargon is used. In addition, please keep using the same jargon throughout the paper.

  2. More description of the figures. For example, Figure 1 could use more description. Also, how do you define inter-pandemic periods, but many died during this period? 

  3. What is the definition of inter-pandemic periods (IP)?

  4. How did you determine the predictive power of the variables before taking it into final consideration? Did you evaluate them using any statistical test?

  5. What supervised machine learning model was run? Did you compare the model performance between the different models run?

  6. What artificial network was used?

  7. Data is based on historical data, but you mention that this is a prospective 

  8. Whether there is a repository of coding.

Connections
1 of 3
Comments
2
?
Kenneth Cantu:

On a different note, if you’re exploring new possibilities in gaming, you might want to consider checking out a Free Download Minecraft APK to explore a world of creativity and adventure.

?
Alan Dewey:

A prospective study on machine learning for identifying high-risk COVID-19 patients uses algorithms to analyze patient data and predict severe outcomes. This approach helps with early intervention, better resource allocation, and personalized treatment plans. By refining models with real-time data, the study ensures continuous accuracy as new variants emerge. For additional resources, visit https://nsfas-online-application.co.za/