Objectives COVID-19 is the biggest pandemic of the 21st century. The disease can be influenced by various sociodemographic factors and can manifest as clinical, pulmonary and gastrointestinal symptoms. This study used an artificial neural network (ANN) model with important sociodemographic factors as well as clinical, pulmonary and gastrointestinal symptoms to screen patients for COVID-19. Patients themselves can screen for these symptoms at home.
Methods Data on all registered patients were extracted in autumn. The best ANN model was selected from different combinations of connections, some hidden layers and some neurons in each hidden layer. In this study, 70% of the data were used in the network training process and the remaining 30% were used to evaluate the function of the multilayer, feed-forward, back-propagation algorithm.
Results The sensitivity and specificity of the ANN model in diagnosing patients with COVID-19 were 94.5% and 17.4%. In order of priority, clinical symptoms, sociodemographic factors, pulmonary symptoms and gastrointestinal symptoms were important predictive factors for COVID-19 using the ANN model. Screening patients for COVID-19 using clinical symptoms and sociodemographic factors (80% importance) remains essential.
Conclusions Home monitoring of oxygen saturation and body temperature as well as old age and drug addiction can be helpful in self-screening symptoms of COVID-19 at home, thereby preventing unnecessary visits to medical centres and reducing burden on medical services.
This article is made freely available for use in accordance with BMJ’s website terms and conditions for the duration of the covid-19 pandemic or until otherwise determined by BMJ. You may use, download and print the article for any lawful, non-commercial purpose (including text and data mining) provided that all copyright notices and trade marks are retained.https://bmj.com/coronavirus/usage
Statistics from Altmetric.com
Most of the COVID-19 symptoms have been identified and assessed separately in previous studies.
The pattern of the relationship between different factors remains unknown for home screening of the disease.
Our screening method had high sensitivity and acceptable specificity, with approximately 95% of patients with COVID-19 identified by home monitoring.
COVID-19 was identified from a series of cases of an acute atypical respiratory disease in Wuhan, Hubei Province in the People’s Republic of China in December 2019.1 2 The COVID-19 pandemic has spread around the world, with 4.6 million deaths across almost 200 countries.2 3 According to WHO, there have been 255 324 963 confirmed cases of COVID-19, including 5 127 696 deaths, as of 17 November 2021. (for up-to-date figures, see https://who.sprinklr.com/).
Various sociodemographic factors and clinical, pulmonary and gastrointestinal symptoms can be seen in patients with COVID-19. Sociodemographic factors such as older age, disease duration, drug addiction and direct contact with patients can be predictive factors for COVID-19. In recent epidemiological studies, old age (usually older than 60 years old) has been indicated to be a high-risk factor for contracting severe COVID-19.2–4 The risk of severe COVID-19 and death can also escalate with drug addiction as it can cause inflammation and reduce the lungs’ ability to respond to infection.5 6 Moreover, this type of virus can spread by inhaling droplets of saliva from sneezing and coughing, or even by exhaling or touching droplets on hands and face, and therefore direct contact with patients and lack of social distancing can effectively spread the virus.
Clinical symptoms such as saturation, body temperature, dizziness, headache, myalgia, loss of consciousness, hyposmia, paresis, weakness and tremor could also affect the diagnosis of COVID-19.7 8 Pulmonary symptoms such as respiratory rate, cough, sore throat, chest pain, dyspnoea and respiratory distress are common symptoms that appear 2–14 days after exposure to the virus.9 Nausea, vomiting, diarrhoea and anorexia are known gastrointestinal symptoms that are also significant in screening for COVID-19.10
The lack of efficient screening methods has created many problems and limitations all over the world. Furthermore, the high transmission rate of COVID-19 revealed a deficiency of applicant screening systems more and more in this regard. Therefore, COVID-19 is affecting more and more people each day, along with the ability of the virus to mutate into various types. Since the pattern of the relationship between different factors remains unknown in this disease, it is necessary to use methods such as an artificial neural network (ANN), which determines the pattern between variables, while not having restrictive underlying assumptions such as normality. In this study, the ANN model was used to screen patients for COVID-19 using some critical sociodemographic factors and clinical, pulmonary and gastrointestinal symptoms, which can be screened by people themselves at home.
Most of the symptoms in the medical care monitoring centres were registered for suspected COVID-19 patients who had PCR tests. Data on all registered patients in South Khorasan Province were extracted from 20 May to 20 December 2020; this allowed us to assess common disease symptoms in spring, summer and autumn separately. After removing patients with missing data, PCR test results were available only for 29 (7 cases tested negative and 22 tested positive), 24 (12 cases tested negative and 12 tested positive) and 1612 (453 cases tested negative and 1159 tested positive) participants in spring, summer and autumn, respectively. The small sample size in spring and summer limited our study to people who had PCR tests in autumn. Data collected in a checklist included age, oxygen saturation (SpO2) by pulse oximetry, body temperature, disease duration, cigarette or drug addiction (0 no, 1 yes), respiratory rate (5–10, 10–14, 14–18, 18–22, 22–28 and >28), direct contact with the patient (0 no, 1 yes), and having any symptoms including cough, sore throat, chest pain, dyspnoea, respiratory distress, dizziness, headache, myalgia, loss of consciousness, hyposmia, paresis, weakness, tremor, nausea, vomiting, diarrhoea and anorexia, and a PCR test result (0 negative, 1 positive). The PCR test result (0 negative, 1 positive) was a response variable.
Multilayer, feed-forward, back-propagation, supervised neural networks are powerful neural networks used in prediction and modelling construction.11 We used these algorithms, and accordingly the best structure of the neural network was selected from different combinations of connection functions (hyperbolic tangent or sigmoid to connect the input and middle layer neurons, linear, hyperbolic tangent, or sigmoid functions to connect the middle and output layer neurons), number of hidden layers (one or two) and number of neurons in each hidden layer (between 2 and 10).11 Of the data, 70% were used in the network training process and the remaining were applied in model evaluation. We repeated each grid three times to avoid random assignment of weights and random correlations. We report the best results. Lastly, the importance of input variables in the diagnosis of patients was determined. All analyses were performed using the Statistical 100 Package for Social Sciences V.26.
Data on 1612 subjects (720 women, 892 men) were assessed. Only 10 subjects were the healthy staff. The mean±SD age of the participants was 58.83±19.84 years. Of the patients who had positive PCR, 115 died and 158 were hospitalised.
Figure 1A shows that the neural network with hyperbolic tangent and sigmoid functions that connect the hidden-input and hidden-output layers with two neurons in the middle layer had acceptable sensitivity (94.5%) and the highest specificity (17.4%). Increasing the number of neurons in the hidden layer and increasing the number of hidden layers improved the performance of the model (figure 1B, C). Therefore, the best ANN model had two hidden layers with five and three neurons and with a hyperbolic tangent and sigmoid functions that connect the hidden-input, hidden-hidden and hidden-output layers, respectively.
Clinical and sociodemographic factors had a nearly 80% importance in the prediction, whereas gastrointestinal and pulmonary symptoms showed minor importance in the prediction with 6% and 14.5%. Clinical symptoms such as body temperature and SpO2, as well as sociodemographic factors such as duration, age and drug addiction, played an essential role in the diagnosis of COVID-19. On the other hand, pulmonary symptoms such as respiratory distress, sore throat and respiratory rate, as well as gastrointestinal symptoms such as nausea and diarrhoea, had the highest importance.
Table 1 shows that respiratory rate lower than 18, direct contact with the patient, and symptoms such as sore throat, chest pain, dyspnoea, respiratory distress, myalgia, hyposmia, weakness, nausea, diarrhoea, anorexia, decreasing SpO2 and body temperature, as well as ageing and disease duration, increase the likelihood of diagnosis of COVID-19.
In order of priority, clinical symptoms, sociodemographic factors, pulmonary symptoms and gastrointestinal symptoms were important predictive factors for COVID-19. In addition, screening patients for COVID-19 using clinical and sociodemographic factors (80% importance) remains essential. Among the different clinical symptoms, body temperature and SpO2 were the most important predictors that can be measured at home by pulse oximetry and thermometer without visiting medical centres. Similar studies have reported that an SpO2 of less than 92 and a body temperature greater than 37.5°C were clinical factors that can identify patients with COVID-19.7 8
Age, drug addiction and disease duration were important sociodemographic factors that predict COVID-19. Studies have shown that susceptibility to this infection increases with increasing age.4 Overall, greater attention must be given to older people and they should be trained to observe social distancing and healthcare measures. The more time that passed from the infection, the more likely it is to detect patients with COVID-19. For people with addiction, having access to healthcare services can be more complex and so they attempt to obtain drugs in unhealthy ways.12 People with addictive disorders need easier accessibility to prescription drugs and healthcare services.12
In this study, a respiratory rate lower than 18 indicated pulmonary symptoms among people with COVID-19. Some studies have indicated that people with COVID-19 also had sore throats.9 Also, most patients with COVID-19 experience respiratory distress in the severe stage, leading to hospitalisation and intubation and admission to the intensive care unit.10 Patients may also experience acute gastrointestinal symptoms including nausea and diarrhoea. In a systematic review and meta-analysis, 15.47% of patients had at least one gastrointestinal symptom.13
The sensitivity and specificity of the ANN model in this study were 94.5% and 17.4%, respectively. In other words, about 95% of patients with COVID-19 were identified by home monitoring using this ANN model. A meta-analysis showed CT scan presented a pooled sensitivity of 95.48%.14 Therefore, the sensitivity of this ANN model and the CT scan in identifying patients with COVID-19 was approximately the same.
COVID-19 transmission and mutation rates are very high15 and early screening for COVID-19 is essential. Therefore, the advantage of having a high sensitivity is preventing the wider spread of the disease in the community, resulting in reduced cost of treatment, reduced work pressure among the medical staff in hospitals and medical centres, and reduced economic, social, cultural and political problems due to COVID-19.
ANN is a relatively novel technique used to predict complex non-linear relationships in epidemiology and public health. Home monitoring of SpO2 and body temperature as well as old age and drug addiction can be helpful when self-screening for COVID-19 at home, with the ANN having an accuracy of 94.5%.
Patient consent for publication
Code of ethics (IR.BUMS.REC.1400.251) was obtained from Birjand University of Medical Sciences Ethics Committee.
We would like to thank Dr Sayyed Gholamreza Mortazavi Moghaddam, Associate Professor of Pulmonary Diseases, for his cooperation during the study. We also would like to thank Mr Seyed Mohammad Jafari for his invaluable assistance in editing this manuscript.
Contributors EA contributed to planning, statistical analysis and interpretation of the results, writing and critical revisions of the manuscript. NR was responsible for study conception and design and contributed to planning, literature search, writing and manuscript editing. Both authors read and approved the final manuscript.
Funding This work was supported by the Birjand University of Medical Sciences Research Council (grant number 4787).
Competing interests None declared.
Provenance and peer review Not commissioned; internally peer reviewed.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.