Recent Advancements of Computer Vision in Healthcare: A Systematic Review
IslamMahmudul1
NayanNasim Mahmud2
IslamAshraful1,2
SikderSankar2,3
RashelMasud Rana4
AlamMd Zahangir1,2
-
( Department of Computer Science and Engineering, Independent University, Bangladesh,
Dhaka 1229, Bangladesh {mahmud, ashraful, zahangir.alam}@iub.edu.bd)
-
( Center for Computational & Data Sciences, Independent University, Bangladesh, Dhaka
1229, Bangladesh smnoyan670@gmail.com, sankar11@louisiana.edu)
-
( Department of Mathematics, University of Louisiana at Lafayette, Louisiana 70503,
USA)
-
( Department of Mechatronics Engineering, University of Évora, Évora, Portugal mrashel@uevora.pt)
Copyright © The Institute of Electronics and Information Engineers(IEIE)
Keywords
Computer vision, Deep learning, Healthcare, Systematic review, Machine learning, State-of-the-art Techniques
1. Introduction
Computer vision (CV) is a branch of artificial intelligence that interprets and analyzes
visual data. CV extracts useful information from images and videos, similar to how
humans perceive and interact with their environment [1]. CV has become an essential technology in modern healthcare. It serves as a crucial
element across various domains from disease detection to surgical assistance. It educates
machines to ``see'' and comprehend the world like humans do. This study presents a
systematic review of recent literature and explores the expanding role, current state-of-the-art
techniques, and implementation strategies of CV in healthcare.
The growing role of CV in healthcare includes the increasing use of cutting-edge techniques
like Convolutional Neural Networks (CNNs) and Long Short-term Memory (LSTM) in various
areas, highlights the importance of a detailed examination. The advancements in the
application of CV to healthcare and the potential of CV to re-shape healthcare have
motivated us to go for this study. CV has the ability to automate complex processes,
decrease human error, and improve diagnostic accuracy. It offers a scalable response
to the rising need for accurate and effective healthcare services by enabling computers
to evaluate and interpret visual data [2].
CV combines several techniques to analyze and make meaning of pictures, including
image recognition, pattern recognition, and Machine Learning [3]. DL techniques improve biomedical image classification with AutoML, combining transfer
and semi-supervised learning for accuracy and speed in small datasets [4]. Significant improvements in image analysis have been made by DL. DL makes it possible
to build artificial neural networks that can learn from enormous volumes of data [5]. More than 30 medical disorders can be tentatively identified using CV techniques
based on automatically recognized symptoms [6]. CV has the ability to diagnose illnesses such as pneumonia and diabetic retinopathy
[7]. CV aids in the real-time tracking and identification of surgical instruments during
operations [8]. In this study, we have found five domains of healthcare where CV has shown promising
results. The domains are human identity decoding [9], surgical assistance, drug discovery, remote patient monitoring [10], and disease detection [7].
Human Identity decoding: CV technology improves the accuracy of human identification.
It recognizes and categorizes different cell types in cellular biology which facilitates
quicker and more precise diagnoses [11]. It helps in managing patient data which ensures that the patient receives the appropriate
therapy.
Surgical Assistance: CNN has become significant in aiding surgical precision. CV based
surgical assistance improves navigation and instrument control during minimally invasive
operations. Technologies like the da Vinci Surgical System is used to make surgeries
safer and more accurate [12].
Drug Discovery: The use of CV in drug development has shown good efficiency. For instance,
CV can now forecast the therapeutic applications of specific medications based on
cell morphological changes. This procedure is labor-intensive and prone to mistakes
when carried out manually [13].
Remote Patient Monitoring: CV revolutionizes remote patient monitoring. It enables
real-time supervision and enhances patient engagement in healthcare. In remote patient
monitoring, CV technologies are more frequently utilized to identify minor alterations
in patients’ physical and physiological states. It enables more responsive and individualized
treatment [14].
Disease Detection: CV has emerged as an important tool for enhancing the accuracy
of disease detection. CV makes early diagnosis and intervention possible. The development
of Machine Learning (ML) based CV algorithms has allowed the accurate diagnosis of
conditions, including breast cancer, skin cancer, and diabetic retinopathy [15].
Fig. 1. Bibliometric analysis.
This study aims to map recent advancements in CV techniques within healthcare to improve
healthcare outcomes. Therefore, we have performed a bibliometric analysis of publications
to evaluate the development of CV in healthcare. Bibliometric analysis is a quantitative
approach for assessing the impact of research by analyzing publication patterns and
other metrics [16]. Fig. 1 shows a graphical representation of the bibliometric analysis which is a network
visualization of keywords related to CV and healthcare domain. A dataset of 451 peer-reviewed
publications published between 2020 and 2023 was obtained from PubMed. Larger nodes
in the network represent keywords that are more dominant in the analyzed documents.
The connecting lines between the nodes suggest a relationship within the literature
where these terms appear together. Different colors represent different categories.
The terms ‘human’, ‘healthcare’, ‘machine learning’, ‘computer vision’, and ‘deep
learning’ in the central of the network denotes their importance in this study. The
proximity of terms ‘medical imaging’, ‘image processing’ and ‘MRI’ to ‘deep learning’
and ‘computer vision’ suggests a strong relationship between these fields. It indicates
that deep learning and computer vision techniques are frequently used to medical imaging
and MRI data analysis.
Additionally, we have investigated the following research questions considering the
recent advancements of CV in healthcare.
RQ1: What types of CV techniques have been investigated for various healthcare applications?
RQ2: What techniques have been developed to enhance the interpretability of CV models
in healthcare applications?
RQ3: In what ways is the integration of CV and sensors optimized to deliver real-time
feedback across various healthcare applications?
RQ4: What are the most effective strategies to establish standardized performance
metrics and evaluation protocols for CV models in healthcare applications?
RQ5: What ethical guidelines and data governance principles are essential to ensure
responsible deployment of CV techniques in healthcare?
RQ6: What approaches have been explored to leverage CV for predictive analytics and
personalized treatment planning?
In the next sections, we will describe the methods and detailed findings of this study.
2. Systematic Review
We conducted this study to understand CV techniques better and promote more innovative
and focused CV applications in healthcare. CV presents new ways to analyze and interpret
intricate visual data. In this study, a total of 20 research papers spanning over
four years (2020-2023) have been examined and extracted from different databases including
PubMed, IEEE Xplore, Science Direct, Web of Science Core Collection, and Google Scholar.
Our review process followed Preferred Reporting Items for Systematic Review and Meta-
Analysis Protocols (PRISMA-P). Fig. 2 shows the PRISMA diagram of this research study.
Fig. 2. PRISMA diagram for this study.
2.1 Search Strategy & Inclusion Exclusion Criteria
We extracted only the papers which are relevant to this research study. Therefore,
we crafted our searches carefully. Rather than just typing in primary keywords, we
mixed and matched terms that fit exactly what we were looking for. For example, instead
of just looking up "computer vision in healthcare" we dialed in our search to be more
detailed. The following search string enabled the search engines of the databases
to locate and display the articles that align with our intended ones. The general
search string for all the databases was:
[("Computer Vision" AND "Healthcare") AND ("Drug Discovery" OR "Disease Detection"
OR "Remote Patient Monitoring" OR "Cancer Detection" OR "Cell Identification" OR "Surgery
Assistance" OR "Human Identity Decoding") NOT ("Surveillance" OR "Robotics" OR "Manufacturing"
OR "Agriculture" OR "Transportation")].
Prior to constructing our search string, we identified five domains from previous
literature [1,2,5,6, 11-15] as significant areas where CV techniques have shown potential for transformative
impact in healthcare. Our goal was to focus our review on domains where CV techniques
could contribute most significantly to advancements in healthcare outcomes. By predefining
these domains in our search string, we aimed to ensure a targeted and relevant exploration
of the literature. The exclusion of domains such as surveillance, robotics, manufacturing,
agriculture, and transportation from our search string is determined by our preliminary
literature study. The excluded domains relate to industrial, agricultural, or security
applications of CV technologies which differ from healthcare domains. Our review aimed
to concentrate on areas where CV directly influences healthcare such as patient care,
medical diagnostics, and therapeutic interventions. Therefore, we excluded these domains.
The construction of our search string and the selection of included and excluded domains
were the result of careful consideration aimed at maximizing the relevance of our
systematic review.
The inclusion criteria refer to the specific conditions that a study needs to meet
to be part of the review. On the other hand, the exclusion criteria list the conditions
that disqualify a study from the review [17]. We have followed specific rules regarding the studies included in our review. Table 1 presents inclusion and exclusion criteria for this study.
Table 1. Inclusion and exclusion criteria.
Inclusion Criteria
|
Exclusion Criteria
|
Searched Keywords: Computer Vision, Healthcare, Disease Detection, Medical Imaging
|
Searched Keywords: Statistical Method, Pharmacological Treatment,
Infectious Conditions, Physical Examination
|
Interest Area: Surgical Assistance, Cancer Detection, Remote Patient Monitoring.
|
Interest Area: Surveillance, Robotics.
|
Article Type: Research Article
|
Article Type: Thesis book, conference abstracts, opinion pieces.
|
Duration: Last four years (2020-2023)
|
Duration: Before 2020
|
Language: Studies Published in English
|
Language: Studies are not available in English
|
2.2 Data Extraction and Analysis
Data collection is a systematic process of gathering and measuring information on
variables for research or analysis. It ensures accurate and relevant information for
decision- making and evaluation [26]. For our review, we carefully examined articles in each stages and selected relevant
studies. As summarized in Table 2, this careful curation from multiple sources has assisted us in identifying gaps
in knowledge and important contributions in the field. We started with 125 papers
from our search strategy. It covers various aspects of healthcare.
Each paper was examined focusing on titles, abstracts, and key findings. Based on
the relevance, and methodological soundness we narrowed our list to 20 papers for
a deeper analysis. Fig. 3 depicts the number of studies selected for this study where CV techniques are successfully
applied in healthcare domains. After analyzing the data, we have found five major
domains such as drug discovery, disease detection, remote patient monitoring, surgical
assistance, and human identity identification. Most studies utilized CNN for their
investigations and surgical assistance was found as the predominant area where CV
techniques were applied.
After data analysis, we have found eight particular techniques have gained attention
in the medical field because of their effectiveness in handling different medical
imaging problems. These techniques include CNN, AdaBoost, Convolutional LSTM (ConvLSTM),
EfficientNetB0, LSTM, Mask R-CNN, Lucas-Kanade, and VGG [18-25]. Fig. 4 shows number of articles reviewed in this study by publication year.
The horizontal bar diagram in Fig. 4 shows the number of articles published from 2020 to 2023 selected for this study.
It shows that 2020 had the highest publication with 6 articles, followed by 5 articles
each in 2021 and 2022. 4 articles have been selected from 2023 for this study.
Fig. 3. CV techniques applied in healthcare domains.
Fig. 4. Number of articles by publication year.
Table 2. Number of articles selected at different stages of this study following PRISMA protocol.
Publisher Name
|
Identification stage
|
Screening stage
|
Inclusion stage
|
Springer
|
21
|
17
|
6
|
IEEE
|
17
|
14
|
5
|
Nature
|
14
|
8
|
3
|
ACS Publications
|
8
|
6
|
1
|
AMIA
|
6
|
4
|
1
|
MDPI
|
8
|
5
|
1
|
BMC
|
16
|
7
|
1
|
Taylor & Francis
|
4
|
1
|
1
|
Elsevier
|
6
|
3
|
1
|
Total
|
100
|
65
|
20
|
3. Results
In this section, we present the results derived from the examination of the twenty
papers published between 2020 and 2023. The exploration led to the identification
of five areas in healthcare where the use of CV is proving significant. These areas
comprise Disease Detection, Remote Patient Monitoring, Drug Discovery, Human Identity
Decoding, and Surgery Assistance. From the analysis of obtained data, it shows that
the most recurrent application of CV is in the area of Surgery Assistance. These findings
highlight the transformative potential of CV in optimizing surgical procedures, enhancing
precision, and, consequently, improving patient outcomes. A detailed summary of the
selected studies is presented in Table 3. The discussion will provide a detailed analysis of the findings. It highlights the
implications of applications within the mentioned healthcare sectors.
Table 3. Summary of the selected studies.
Source
|
Year
|
Objective
|
Method
|
Findings
|
[27]
|
2022
|
To detect skin cancer in the early stage.
|
NetB0 Algorithm.
|
DL Model excels in skin cancer detection with a 0.909 area under curve.
|
[28]
|
2022
|
To detect skin cancer.
|
CNN
|
Proposed approach achieved 83.2% accuracy in skin cancer detection.
|
[29]
|
2020
|
To identify and diagnose breast cancer at an early stage .
|
AdaBoost algorithm.
|
The applied method enhances breast cancer accuracy detection.
|
[30]
|
2022
|
To quantify architectural stripes in chromatin.
|
Stripenn algorithm.
|
The applied method can identify and quantify architectural stripes in chromatin conformation
data.
|
[31]
|
2020
|
To differentiate black legged ticks from other tick species.
|
CNN.
|
CNN model achieves 92% accuracy in predicting Lyme disease.
|
[32]
|
2020
|
To develop an assistive diagnostic tool for brain tumor detection.
|
Mask R-CNN.
|
90% accuracy has been achieved using Mask R-CNN.
|
[33]
|
2023
|
To detect and characterize white blood and red blood cells in peripheral blood smears.
|
Haemorasis computational
Method.
|
Morphometric regression demonstrated high predictive performance with 89.7% AUC.
|
[34]
|
2022
|
To develop computational methods for protein binding site alignment.
|
ph4-strict, rules,
Open3D, ProCare.
|
This study finds 95% statistical score using ProCare.
|
[35]
|
2020
|
To increase patient safety by resolving prescription mistakes.
|
DL drug identification (DLDI) model.
|
DLDI model identified the prescribed drugs with more than 90% accuracy from blister
packaging.
|
[36]
|
2023
|
To develop an automated face identification system for patients.
|
DeepFace technology combines SSD Caffe and 19-layer VGG models.
|
DeepFace detects faces and retrieves EHRs within seconds with 97.35% accuracy on the
LFW dataset.
|
[37]
|
2021
|
To monitor physiological condition remotely.
|
rPPGNet.
|
The rPPGNet achieved state of the art in RePSS Challenge with MIBI at 117.25 and RHR
at 0.62.
|
[38]
|
2023
|
To develop a low-cost solution for monitoring cardiac patients using smartphone cameras.
|
Lucas-Kanade template tracking.
|
Estimated heart rates in this study closely matched standard measurements with a slight
bias of 0.649 beats/min.
|
[39]
|
2023
|
To develop a CV based systemfor the patients of Parkinsonian bradykinesia.
|
Unified Parkinson's Disease Rating Scale (UPDRS).
|
The proposed approach achieved 86% accuracy in all items.
|
[40]
|
2020
|
To monitor patients' various medical conditions such as body temperature, heart rate
remotely.
|
An IoT based system
with wearable sensors.
|
The developed system can effectively monitor patients' vital signals remotely.
|
[41]
|
2021
|
To enhance surgeons' capabilities by controlling operating hands.
|
CNN
|
CNN achieved best performance with an mAP of 75.4 on the selected dataset.
|
[42]
|
2020
|
To evaluate suturing performance in surgery .
|
Suture analysis
Software (SAS) and SMOF.
|
The system can effectively evaluate surgical suturing skills for optimal wound healing.
|
[43]
|
2021
|
To verify surgical vision algorithms.
|
VisionBlender.
|
VisionBlender can verify algorithms and generate datasets.
|
[44]
|
2021
|
To identify and classify suturing gestures .
|
convLSTM.
|
convLSTM identifies and classifies suturing gestures with an AUC of 0.88 and 0.87
|
[45]
|
2021
|
To analyze upper eyelid surgery.
|
DLIB ML toolkit.
|
The postoperative changes in group two were greater than group one.
|
[46]
|
2022
|
To analyze and Validate EndoDigest.
|
CNN.
|
CNN predicted cystic duct division with 75% video documentation.
|
3.1 RQ1: What Types of CV Techniques have been Investigated for Various Healthcare
Applications?
DL techniques specifically CNNs have emerged as effective solutions for diverse healthcare
applications. They increase the accuracy of diagnosis and treatment planning across
various healthcare applications. For instance, the EfficientNet-B0 Algorithm is a
type of CNN that effectively detects skin cancer with an Area Under the Curve (AUC)
of 0.909 [27] while CNNs achieve an accuracy of 83.2\% in similar tasks [28]. Among twenty studies reviewed, two specifically experimented with skin cancer detection
using DL techniques though they measured outcomes in two different ways. One study
highlighted the enhanced accuracy in breast cancer detection using the AdaBoost algorithm
[29]. Apart from skin cancer detection, CNNs have been applied to different types of healthcare
applications such as predicting Lyme disease risk with 92\% accuracy [31], developing assistive tools for diagnosing brain tumors with 90\% accuracy [32], assisting in surgery [41,46], and enabling remote patient monitoring [37]. Beyond CNNs, there are other techniques being utilized in various healthcare applications
such as LSTM, AdaBoost, VGG, and the Stripenn algorithm. One study reported that the
convLSTM model effectively classified suturing gestures with an AUC of 0.88 [44] while VGG models identified faces with 97.35\% accuracy [36]. The effective application of DL involves systematic steps. Comprehensive datasets
contributed to the success of drug identification and face retrieval [35]. Transfer learning [41,44] optimized scenarios with limited data, while regularization countered overfitting
[38]. Model interpretability was advanced [42] to develop clinical trust and collaboration with experts [42,46] to ensure clinical alignment. Although the impact of DL techniques on various healthcare
applications is proven through systematic strategies mentioned across studies but
challenges persist in CV systems’ seamless integration with existing healthcare systems.
3.2 What Techniques have been Developed to Enhance the Interpretability of CV Models
in Healthcare Applications?
Enhancing the interpretability of CV models in healthcare applications is significant
for clinician trust and model adoption. Various techniques have been developed to
address this issue which focuses on both global and local interpretability. Self-attention
mechanisms [37] enable CV models to highlight significant image regions for better understanding.
Additionally, model-specific techniques interpret predictions generated by a specific
model such as a neural network. Visualization of feature activations [44] demonstrates learned representations and links them to medical characteristics. Another
category is model-agnostic techniques which provide easy-to-understand explanations
of predictions. Integrating domain knowledge [37] augments interpretability by aligning predictions with medical insights. Techniques
like Grad-CAM and SHAP [27,38] offer visual explanations that clarify model decision rationales. These strategies
facilitate transparent and clinically relevant CV applications. Although integrating
interpretable models and self-attention mechanisms has led to higher accuracy but
there is a need for models that can generalize well across various medical conditions
to ensure consistent interpretability which still remains a challenge. Moreover, the
ability to operate in real-time and at scale is also a challenge.
3.3 RQ3: In what Ways is the Integration of CV and Sensors Optimized to Deliver Real-time
Feedback Across Various Healthcare Applications?
The integration of CV and sensors in healthcare applications has been optimized to
deliver real-time feedback across various domains such as patient monitoring, drug
discovery, and surgical assistance. One study showed that integrating SAS software
with fiber optic strain sensors enables immediate evaluation of suturing skills [42]. It enhances precision and provides actionable insights for skill refinement. This
integration offers real-time assessment of surgical techniques and contributes to
quantifiable proficiency metrics. Also, it contributes to minimally invasive surgery
scenarios by leveraging ML algorithms to analyze sensor data and visual inputs. The
integrated approach could enable prompt recognition of effective surgical maneuvers
which promote better decision-making. The evolution of algorithms and synchronization
mechanisms has the potential to revolutionize surgical training which increases patient
safety [44]. For remote patient monitoring, incorporating both CV and physiological sensors has
led to continuous health monitoring and real-time feedback. For example, a wearable
sensor-integrated Internet of Things (IoT)-based system has shown notable success
in effectively monitoring patients remotely [40]. This real-time feedback has improved the effectiveness of interventions through
immediate adjustments based on sensor data. In drug discovery, the Deep Learning Drug
Identification (DLDI) model identified prescribed drugs with more than 90\% accuracy
from blister packaging [35] which shows the impact of CV and sensor integration on real-time feedback. Despite
these advancements, the collection and transmission of patient health data have faced
privacy and security challenges.
3.4 RQ4: What are the Most Effective Strategies to Establish Standardized Performance
Metrics and Evaluation Protocols for CV Models in Healthcare Applications?
Establishing standardized performance metrics and evaluation protocols for CV models
in healthcare applications is crucial for achieving consistency and reliability. A
structured framework is essential but various studies have utilized different types
of metrics for evaluation due to the examination of different datasets. Accuracy and
AUC are the two most commonly used performance metrics across the studies. For instance,
out of twenty studies reviewed, eight selected accuracy as the performance metric
while three went for AUC. Strategies derived from different research studies focus
on effective approaches. One study [44] shows the significance of disease-specific metrics by achieving notable classification
and identification performance using AUC. Another study [36] reports the importance of domain-specific evaluation by attaining a 97.35\% accuracy
rate in face detection. The integration of explainable AI techniques as demonstrated
in these studies [37,44] not only enhances transparency but also helps in the evaluation process. Collaborative
efforts with clinicians and domain experts are important to ensure clinically meaningful
evaluation criteria. Additionally, benchmark datasets as mentioned in one study [44], provide a foundation for standardized assessment. The adoption of standardized performance
metrics and evaluation protocols will enhance the accuracy of CV models. However,
to ensure the robustness of CV models more diverse datasets can be produced and tested
by CV models.
3.5 RQ5: What Ethical Guidelines and Data Governance Principles are Essential to Ensure
Responsible Deployment of CV Techniques in Healthcare?
The responsible deployment of CV techniques in healthcare requires adherence to ethical
guidelines and data governance principles. Ensuring the privacy of medical data is
a challenge. During the experiment informed consent is obtained from patients but
maintaining privacy becomes complex when various stakeholders are involved during
experiments. Ethical guidelines and data governance have been emphasized across different
studies. One study highlights privacy-preserving techniques, particularly in sensitive
applications like human face recognition [36]. Other research has explored blockchain-based patient data control where data anonymization
and de-identification have been performed to mitigate risks associated with patient
data exposure [43]. The application of fairness principles can help to mitigate biases while interdisciplinary
collaboration and regulatory involvement are crucial for developing a comprehensive
ethical framework [45]. Transparency and accountability are important for justifying model decisions and
addressing unexpected outcomes. As the AI techniques are evolving rapidly, it is crucial
that existing ethical guidelines and governance frameworks be continuously updated
to tackle new challenges. Overall, the integration of CV in healthcare requires a
balanced approach that prioritizes patient privacy.
3.6 What Approaches have been Explored to Leverage CV for Predictive Analytics and
Personalized Treatment Planning?
Leveraging CV for predictive analytics and personalized treatment planning in healthcare
includes various methods as highlighted by different studies. In predictive analytics,
out of twenty studies reviewed, eight studies have utilized CNN models where the highest
testing accuracy was 92\% in predicting Lyme disease risk [31]. Other approaches, such as convLSTM and the AdaBoost algorithm have also been explored
for predictive analytics. The convLSTM model effectively classified suturing gestures
with an AUC of 0.88 [44] while AdaBoost showed enhanced accuracy in breast cancer prediction. The integration
of CV with predictive analytics enables more precise treatment planning, especially
in complex cases like cardiac care which is evidenced by a study that reported monitoring
cardiac patients using a smartphone camera [38]. Various CV-based works have been reported for personalized treatment including the
development of a CV-based system for Parkinson’s patients [39], and CNN and SAS aided systems to improve personalized surgery treatment [41,42]. Motion analysis has increased the accuracy of these aided systems. The integration
of visual and sensor-generated data offers opportunities for dynamic personalized
interventions [43]. DL models enhance breast cancer detection when combined with other patient data
such as radiomics and genomics which offers a holistic approach to personalized treatment
planning [29]. Collaborative interdisciplinary efforts ensure comprehensive personalized approaches
although challenges remain in seamlessly integrating these advanced CV techniques
into existing clinical workflows. Overall, these findings highlight the significant
potential of leveraging CV for predictive analytics and personalized treatment planning
in healthcare
5. Conclusion
The integration of CV and DL into healthcare is progressive. Motivated by the immense
potential of CV, we conducted this study. To perform a systematic review, we followed
the PRISMA guidelines. We have extracted the most recent studies from the last four
years, from 2020 to 2023. Initially, 125 papers were selected for this study. After
screening, we have found 20 relevant papers for final review study. CV amplified by
CNNs has broad applications such as early disease detection, precise surgical procedures,
remote patient monitoring, quick drug discovery and identity verification. Medical
surgery has been particularly transformed by CV technology. DL models demonstrate
their adaptability and accuracy across different medical imaging tasks in various
studies. The integration of CV and sensor technologies offers real-time insights into
surgical skills assessment. Moreover, CV’s role in predictive analytics and personalized
treatment planning shows its capacity to enhance patient-specific care.
Standardized performance metrics, ethical considerations, and data governance principles
are crucial for responsible model deployment. This brief review study provides valuable
insights but there are some limitations. Limitations include potentially narrow scope
since only 20 studies have been examined. Therefore, this study might be missing potential
healthcare domains where CV can be applied beyond the selected domains. Another limitation
is the time frame. We selected papers from the recent years (2020-2023) as our focus
is on exploring the latest advancements in this field. Therefore, there is a possibility
that other potential applications of CV in healthcare may have been overlooked. For
this review study, we considered only peer-reviewed articles from specific databases.
Although exploring other databases might reveal additional insights into recent developments
of CV in healthcare. Future work will include conducting a more extensive study over
a longer timeframe. This may reveal other potential applications of CV in the healthcare
sector.
REFERENCES
Karn, A. Artificial intelligence in computer vision. International Journal of Engineering
Applied Sciences and Technology. 2021.
Lee, M.K. Compound computer vision workflow for efficient and automated immunohistochemical
analysis of whole slide images. Journal of Clinical Pathology 2022.
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. nature 2015, 521, 436-444.
In\'{e}s, A. Biomedical image classification made easier thanks to transfer and semi-supervised
learning. Computer methods and programs in biomedicine 2020, 198, 105782.
Liu, T.; Siegel, E.; Shen, D. Deep learning and medical image analysis for COVID-19
diagnosis and prediction. Annual review of biomedical engineering 2022, 24, 179-201.
Thevenot, J. A Survey on Computer Vision for Assistive Medical Diagnosis From Faces.
IEEE Journal of Biomedical and Health Informatics 2018, 22, 1497-1511.
Kermany, D.S.; Goldbaum, M.; Cai, W.; Valentim, C.C.; Liang, H.; Baxter, S.L.; McKeown,
A.; Yang, G.; Wu, X.; Yan, F.; et al. Identifying medical diagnoses and treatable
diseases by image-based deep learning. cell 2018, 172, 1122-1131.
Khan, Z.A.; Beghdadi, A.; Cheikh, F.A.; Kaaniche, M.; Pelanis, E.; Palomar, R.; Fretland,
\AA{}.A.; Edwin, B.; Elle, O.J. Towards a video quality assessment based framework
for enhancement of laparoscopic videos. In Proceedings of the Medical Imaging 2020:
Image Perception, Observer Performance, and Technology Assessment. SPIE, 2020, Vol.
11316, pp. 129-136.
Bah, S.M.; Ming, F. An improved face recognition algorithm and its application in
attendance management system. Array 2020, 5, 100014.
Cut, itoi, A.C. Remote Patient Monitoring Systems, Wearable Internet of Medical Things
Sensor Devices, and Deep Learning-based Computer Vision Algorithms in COVID-19 Screening,
Detection, Diagnosis, and Treatment. American Journal of Medical Research 2022, 9,
129-144.
Xing, F.; Xie, Y.; Su, H.; Liu, F.; Yang, L. Deep learning in microscopy image analysis:
A survey. IEEE transactions on neural networks and learning systems 2017, 29, 4550-4568.
Hanna, M.G.; Ahmed, I.; Nine, J.; Prajapati, S.; Pantanowitz, L. Augmented reality
technology using Microsoft HoloLens in anatomic pathology. Archives of pathology &
laboratory medicine 2018, 142, 638-644.
Godinez, W.J.; Hossain, I.; Lazic, S.E.; Davies, J.W.; Zhang, X. A multi-scale convolutional
neural network for phenotyping high-content cellular images. Bioinformatics 2017,
33, 2010-2019.
Chan, M.; Est\`{e}ve, D.; Fourniols, J.Y.; Escriba, C.; Campo, E. Smart wearable systems:
Current status and future challenges. Artificial intelligence in medicine 2012, 56,
137-156.
Esteva, A.; Kuprel, B.; Novoa, R.A.; Ko, J.; Swetter, S.M.; Blau, H.M.; Thrun, S.
Dermatologist- level classification of skin cancer with deep neural networks. nature
2017, 542, 115-118.
Ogutu, H.; El Archi, Y.; D\'{e}nes D\'{a}vid, L. Current trends in sustainable organization
management: A bibliometric analysis. Oeconomia Copernicana 2023, 14, 11-45.
Randolph, J. A guide to writing the dissertation literature review. Practical assessment,
research, and evaluation 2009, 14, 13.
Afridi, T.H.; Alam, A.; Khan, M.N.; Khan, J.; Lee, Y.K. A multimodal memes classification:
A survey and open research issues. In Proceedings of the Innovations in Smart Cities
Applications Volume 4: The Proceedings of the 5th International Conference on Smart
City Applications. Springer, 2021, pp. 1451-1466.
Bai, X. Virtual garden landscape planning based on FPGA and GIS platform. Microprocessors
and Microsystems 2020, 79, 103314.
Saeedi, R.; Norgaard, S.; Gebremedhin, A.H. A closed-loop deep learning architecture
for robust activity recognition using wearable sensors. In Proceedings of the 2017
IEEE International Conference on Big Data (Big Data). IEEE, 2017, pp. 473-479.
Amin, H.; Darwish, A.; Hassanien, A.E.; Soliman, M. End-to-end deep learning model
for corn leaf disease classification. IEEE Access 2022, 10, 31103-31115.
Kang, R.; Park, B.; Ouyang, Q.; Ren, N. Rapid identification of foodborne bacteria
with hyperspectral microscopic imaging and artificial intelligence classification
algorithms. Food Control 2021, 130, 108379.
Jung, H.; Lodhi, B.; Kang, J. An automatic nuclei segmentation method based on deep
con- volutional neural networks for histopathology images. BMC Biomedical Engineering
2019, 1, 1-12.
Kim, H.; Cho, J.; Jung, Y.; Lee, S.; Jung, Y. Area-efficient vision-based feature
tracker for autonomous hovering of unmanned aerial vehicle. Electronics 2020, 9, 1591.
Sinanog˘ lu, N. Fully Convolutional Bottleneck Siamese Networks for Change Detection
in Satellite Images. 2022.
Lobe, B.; Morgan, D.; Hoffman, K.A. Qualitative data collection in an era of social
distancing. International journal of qualitative methods 2020, 19, 1609406920937875.
Khan, Z.; Shubham, T.; Arya, R.K. Skin Cancer Detection Using Computer Vision. In
Proceed- ings of the International Conference on Computational Techniques and Applications.
Springer, 2021, pp. 3-11.
Gouda, W.; Sama, N.U.; Al-Waakid, G.; Humayun, M.; Jhanjhi, N.Z. Detection of skin
cancer based on skin lesion images using deep learning. In Proceedings of the Healthcare.
MDPI, 2022, Vol. 10, p. 1183.
Zheng, J.; Lin, D.; Gao, Z.; Wang, S.; He, M.; Fan, J. Deep learning assisted efficient
AdaBoost algorithm for breast cancer detection and early diagnosis. IEEE Access 2020,
8, 96946-96954.
Yoon, S.; Chandra, A.; Vahedi, G. Stripenn detects architectural stripes from chromatin
confor- mation data using computer vision. Nature Communications 2022, 13, 1602.
Akbarian, S.; Cawston, T.; Moreno, L.; Patel, S.; Allen, V.; Dolatabadi, E. A computer
vision approach to combat lyme disease. arXiv preprint arXiv:2009.11931 2020.
Ankireddy, S. Assistive diagnostic tool for brain tumor detection using computer vision.
In Proceedings of the 2020 IEEE MIT Undergraduate Research Technology Conference (URTC).
IEEE, 2020, pp. 1-4.
de Almeida, J.G.; Gudgin, E.; Besser, M.; Dunn, W.G.; Cooper, J.; Haferlach, T.; Vassiliou,
G.S.; Gerstung, M. Computational analysis of peripheral blood smears detects disease-associated
cytomorphologies. Nature Communications 2023, 14, 4378.
Eguida, M.; Rognan, D. A computer vision approach to align and compare protein cavities:
application to fragment-based drug design. Journal of Medicinal Chemistry 2020, 63,
7127-7142.
Ting, H.W.; Chung, S.L.; Chen, C.F.; Chiu, H.Y.; Hsieh, Y.W. A drug identification
model developed using deep learning technologies: experience of a medical center in
Taiwan. BMC health services research 2020, 20, 1-9.
Verma, V.K.; Kansal, V.; Bhatnagar, P. Patient Identification using Facial Recognition.
In Proceedings of the 2020 International Conference on Futuristic Technologies in
Control Systems & Renewable Energy (ICFCR). IEEE, 2020, pp. 1-7.
Dong, Y.; Yang, G.; Yin, Y. Time Lab’s Approach to the Challenge on Computer Vision
for Remote Physiological Measurement. In Proceedings of the Proceedings of the IEEE/CVF
International Conference on Computer Vision, 2021, pp. 2398-2403.
Rahman, M.M.; Cook, J.; Taebi, A. Non-contact heart vibration measurement using computer
vision-based seismocardiography. Scientific Reports 2023, 13, 11787.
Morinan, G.; Dushin, Y.; Sarapata, G.; Rupprechter, S.; Peng, Y.; Girges, C.; Salazar,
M.; Milabo, C.; Sibley, K.; Foltynie, T.; et al. Computer vision quantification of
whole-body Parkinsonian bradykinesia using a large multi-site population. npj Parkinson’s
Disease 2023, 9, 10.
\includegraphics[width=1\textwidth]{3.docx.tmp/word/media/image5.jpeg}Zainuddin, A.A.;
Superamaniam, S.; Andrew, A.C.; Muraleedharan, R.; Rakshys, J.; Miriam, J.; Bostomi,
M.A.S.M.; Rais, A.M.A.; Khalidin, Z.; Mansor, A.F.; et al. Patient monitoring system
using computer vision for emotional recognition and vital signs detection. In Proceedings
of the 2020 IEEE student conference on research and development (SCOReD). IEEE, 2020,
pp. 22-27.
Zhang, M.; Cheng, X.; Copeland, D.; Desai, A.; Guan, M.Y.; Brat, G.A.; Yeung, S. Using
computer vision to automate hand detection and tracking of surgeon movements in videos
of open surgery. In Proceedings of the AMIA Annual symposium proceedings. American
Medical Informatics Association, 2020, Vol. 2020, p. 1373.
Handelman, A.; Keshet, Y.; Livny, E.; Barkan, R.; Nahum, Y.; Tepper, R. Evaluation
of suturing performance in general surgery and ocular microsurgery by combining computer
vision-based software and distributed fiber optic strain sensors: a proof-of-concept.
International Journal of Computer Assisted Radiology and Surgery 2020, 15, 1359-1367.
Cartucho, J.; Tukra, S.; Li, Y.; S. Elson, D.; Giannarou, S. VisionBlender: a tool
to efficiently generate computer vision datasets for robotic surgery. Computer Methods
in Biomechanics and Biomedical Engineering: Imaging & Visualization 2021, 9, 331-338.
Luongo, F.; Hakim, R.; Nguyen, J.H.; Anandkumar, A.; Hung, A.J. Deep learning-based
computer vision to recognize and classify suturing gestures in robot-assisted surgery.
Surgery 2021, 169, 1240-1244.
Bah\c{c}eci S¸ims¸ek, I˙.; S¸irolu, C. Analysis of surgical outcome after upper eyelid
surgery by computer vision algorithm using face and facial landmark detection. Graefe’s
Archive for Clinical and Experimental Ophthalmology 2021, 259, 3119-3125.
Mascagni, P.; Alapatt, D.; Laracca, G.G.; Guerriero, L.; Spota, A.; Fiorillo, C.;
Vardazaryan, A.;Quero, G.; Alfieri, S.; Baldari, L.; et al. Multicentric validation
of EndoDigest: a computer vision platform for video documentation of the critical
view of safety in laparoscopic cholecystectomy. Surgical Endoscopy 2022, 36, 8379-8386.
Mahmudul Islam received his M.S. degree in Software Engineering from Independent University,
Bangladesh (IUB) in 2023. Currently he is working as a research officer at IUB.
Nasim Mahmud Nayan is a research assistant at Center for Computational and Data Sciences
(CCDS), IUB. He is also a graduate student at the University of Information Technology
and Sciences.
Ashraful Islam, PhD is an assistant professor of Computer Science and Engineering
at IUB. He also holds the position of Co-Director of HCI Wing at Center for Computational
and Data Sciences (CCDS), IUB.
Sankar Sikder received his M.S. in Mathematics from University of Louisiana at Lafayette
(UL Lafayette) in 2020. Currently, he is a PhD candidate at the Department of Mathematics,
UL Lafayette.
Masud Rana Rashel, PhD is a researcher and academic known for his contributions to
Renewable Energy and Embedded Systems. He is currently working as a Research Fellow
at the Instrumentation and Control Laboratory, Department of Mechatronics Engineering,
University of Évora, Portugal.
Md Zahangir Alam, PhD is an assistant professor of Computer Science and Engineering
at IUB. He also hold the position of Co-Director of AI-ML Wing at CCDS, IUB.