Astronaut loading animation Circular loading bar

Try : Insurtech, Application Development

AgriTech(1)

Augmented Reality(20)

Clean Tech(5)

Customer Journey(12)

Design(35)

Solar Industry(6)

User Experience(55)

Edtech(10)

Events(34)

HR Tech(2)

Interviews(10)

Life@mantra(11)

Logistics(5)

Strategy(17)

Testing(9)

Android(47)

Backend(30)

Dev Ops(7)

Enterprise Solution(27)

Technology Modernization(2)

Frontend(28)

iOS(43)

Javascript(15)

AI in Insurance(35)

Insurtech(63)

Product Innovation(49)

Solutions(19)

E-health(9)

HealthTech(22)

mHealth(5)

Telehealth Care(4)

Telemedicine(5)

Artificial Intelligence(132)

Bitcoin(8)

Blockchain(19)

Cognitive Computing(7)

Computer Vision(8)

Data Science(17)

FinTech(50)

Banking(7)

Intelligent Automation(26)

Machine Learning(47)

Natural Language Processing(14)

expand Menu Filters

How we used RetinaNet for dense shape detection in live imagery

Convolutional Neural Networks (CNN) have come a long way in conveniently identifying objects in images and videos. Networks like VGG19, ResNet, YOLO, SSD, R-CNN, DensepathNet, DualNet, Xception, Inception, PolyNet, MobileNet, and many more have evolved over time. Their range of applications lies in detecting space availability in a parking lot, satellite image analysis to track ships and agricultural output, radiology, people count, detecting words in vehicle license plates and storefronts, circuits/machinery fault analysis, medical diagnosis, etc.

Facebook AI Research (FAIR) has recently published RetinaNet architecture which uses Feature Pyramid Network (FPN) with ResNet. This architecture demonstrates higher accuracy in situations where speed is not really important. RetinaNet is built on top of FPN using ResNet.

Comparing tradeoff between speed and accuracy of different CNNs

Google offers benchmark comparison to calculate tradeoff between speed and accuracy of various networks using MS COCO dataset to train the models in TensorFlow. It gives us a benchmark to understand the best model that provides a balance between speed and accuracy. According to researchers, Faster R-CNN is more accurate, whereas R-FCN and FCN show better inference time (i.e. their speed is higher). Inception and ResNet are implementations of Faster R-CNN. MobileNet is an implementation of SSD.

Faster R-CNN implementations show an overall mAP (mean average precision) of around 30, which is highest for feature extraction. And, at the same time, its accuracy is also highest at around 80.5%. MobileNet R-FCN implementation has a lower mAP of around 15. Therefore, its accuracy drops down to about 71.5%. 

Thus, we can say — SSD implementations work best for detecting larger objects whereas, Faster R-CNN and R-FCN are better at detecting small objects.

speed and accuracy of various CNNs

On the COCO dataset, Faster R-CNN has average mAP for IoU (intersection-over-union) from 0.5 to 0.95 (mAP@[0.5, 0.95]) as 21.9% . R-FCN has mAP of 31.5% . SSD300 and SSD512 have mAPs of 23.2 and 26.8 respectively . YOLO-V2 is at 21.6% whereas YOLO-V3 is at 33% . FPN delivers 33.9% . RetinaNet stands highest at 40.8%.

RetinaNet- AP vs speed comparison
The two variations of RetinaNet are compared above for AP vs speed (ms) for inference.

One-stage detector vs two-stage detectors for shape detection

A One-stage detector scans for candidate objects sampled for around 100000 locations in the image that densely covers the spatial extent. This does not let the class balance between background and foreground. 

A Two-stage detector first narrows down the number of candidate objects on up to 2000 locations and separates them from the background in the first stage. It then classifies each candidate object in the second stage, thus managing the class balance. But, because of the smaller number of locations in the sample, many objects might escape detection. 

Faster R-CNN is an implementation of the two-stage detector. RetinaNet, an implementation of one stage detector addresses this class imbalance and efficiently detects all objects.

Focal Loss: a new loss function

This function focuses on training on hard negatives. It is defined as-

focal loss function

Where,

focal loss function

and p = sigmoid output score.

The greeks are hyperparameters.


When a sample classification is inappropriate and pₜ is small, it does not affect the loss. Gamma is a focusing parameter and adjusts the rate at which the easy samples are down-weighted. Samples get down-weighted when their classification is inappropriate and pₜ is close to 1. When gamma is 0, the focal loss is close to the cross-entropy loss. Upon increasing gamma, the effect of modulating factor also increases.

RetinaNet Backbone

The new loss function called Focal loss increases the accuracy significantly. Essentially it is a one-stage detector Feature Pyramid Network with Focal loss replacing the cross-entropy loss. 

Hard negative mining in a single shot detector and Faster R-CNN addresses the class imbalance by downsampling the dominant samples. On the contrary, RetinaNet addresses it by changing the weights in the loss function. The following diagram explains the architecture.

RetinaNet architecture

Here, deep feature extraction uses ResNet. Using FPN on top of ResNet further helps in constructing a multi-scale feature pyramid from a single resolution image. FPN is fast to compute and works efficiently on multiscale.

Results

We used ResNet50-FPN pre-trained on MS COCO to identify humans in the photo. The threshold is set above a score of 0.5. The following images show the result with markings and confidence values.

Dense shape detection
Human shape detection

We further tried to detect other objects like chairs.

RetinaNet object detection

Conclusion: It’s great to know that training on the COCO dataset can detect objects from unknown scenes. The object detection in the scenes took 5-7 seconds. So far, we have put filters of human or chair in results. RetinaNet can detect all the identifiable objects in the scene.

Multiple objects detection using RetinaNet

The different objects detected with their score are listed below-

human0.74903154
human0.7123633
laptop0.69287986
human0.68936586
bottle0.67716646
human0.66410005
human0.5968385
chair0.5855772
human0.5802317
bottle0.5792091
chair0.5783555
chair0.538948
human0.52267283

Next, we will be interested in working on a model good in detecting objects in the larger depth of the image, which the current ResNet50-FPN could not do.

About author: Harsh Vardhan is a Tech Lead in the Development Department of Mantra Labs. He is integral to AI-based development and deployment of projects at Mantra Labs.

General FAQs

What is RetinaNet?

RetinaNet is a type of CNN (Convolutional Neural Network) architecture published by Facebook AI Research also known as FAIR. It uses the Feature Pyramid Network (FPN) with ResNet. RetinaNet is widely used for detecting objects in live imagery (real-time monitoring systems). This architecture demonstrates a high-level of accuracy, but with a little compromise in speed. In the experiment we conducted, it took 5-7 seconds for object detection in live scenes.Dense shape detection - RetinaNet

What is RetinaNet Model?

RetinaNet model comprises of a backbone network and two task-specific sub-networks. The backbone network is a Feature Pyramid Network (FPN) built on ResNet. It is responsible for computing a convolution feature (object) from the input imagery. The two subnetworks are responsible for the classification and box regression, i.e. one subnet predicts the possibility of the object being present at a particular spatial location and the other subnetwork outputs the object location for the anchor box.

What is Focal Loss?

The focal loss function focuses on training on hard negatives. In other words, the focal loss function is an algorithm for improving Average Precision (AP) in single-stage object detectors. It is defined as-RetinaNet focal loss function

What is SSD Network?

Single Shot Detector (SSD) can detect multiple objects in an image in a single shot, hence the name. 
The beauty of SSD networks is that it predicts the boundaries itself and has no assigned region proposal network. SSD networks can predict the boundary boxes and classes from feature maps in just one pass by using small convolutional filters.

Glossary of Terms related to convolutional neural networks

CNN

Deep Learning uses Convolutional neural networks (CNN) for analyzing visual imagery. It consists of an input and output layer and multiple intermediate layers. In CNN programming, the input is called a tensor, which is usually an image or a video frame. It passes through the convolutional layer forming an abstract feature map identifying different shapes.

R-CNN

The process of combining region proposals with CNN is called as R-CNN. Region proposals are the smaller parts of the original image that have a probability of containing the desired shape/object. The R-CNN algorithm creates several region proposals and each of them goes to the CNN network for better dense shape detection.

ResNet

Residual Neural Network (ResNet) utilizes skip connections to jump over some layers. Classical CNNs do not perform when the depth of the network increases beyond a certain threshold. Most of the ResNet models are implemented with double or triple layer skips with batch normalization in between. ResNet helps in the training of deeper networks.

YOLO

You only look once (YOLO) is a real-time object detection system. It is faster than most other neural networks for detecting shapes and objects. Unlike other systems, it applies neural network functions to the entire image, optimizing the detection performance.

FAIR

It is Facebook’s AI Research arm for understanding the nature of intelligence and creating intelligent machines. The main research areas at FAIR include Computer Vision, Conversational AI, Integrity, Natural Language Processing, Ranking and Recommendations, System Research, Theory, Speech & Audio, and Human & Machine Intelligence.

FPN

Feature Pyramid Network (FPN) is a feature extractor designed for achieving speed and accuracy in detecting objects or shapes. It generates multiple feature map layers with better quality information for object detection.

COCO Dataset

Common Objects in Context (COCO) is a large-scale dataset for detecting, segmenting, and captioning any object. 

FCN

Fully Convolutional Network (FCN) transforms the height and width of the intermediate layer (feature map) back to the original size so that predictions have a one-to-one correspondence with the input image. 

R-FCN

R-FCN corresponds to a region-based fully convolutional network. It is mainly used for feature detection. R-FCN comprises region-based feature maps that are independent of region proposals (ROI) and carry computation outside of ROIs. It is much simpler and about 20 times faster than R-CNN. 

TensorFlow

It is an open-source software library developed by Google Brain for a range of dataflow and differential programming applications. It is also useful in neural network programming. 

Also read – How are Medical Images shared among Healthcare Enterprises



Cancel

Knowledge thats worth delivered in your inbox

CX Innovations in Healthcare: Doctor Engagement Strategies in the USA

The importance of customer experience (CX) in healthcare cannot be overstated. A positive CX is crucial not only for patient satisfaction but also for the overall efficiency and success of healthcare providers. One critical aspect of CX in healthcare is doctor engagement, which refers to the strategies and practices used to involve doctors in the healthcare delivery process actively.

Doctor engagement is essential for several reasons. Firstly, engaged doctors are more likely to be committed to their work, leading to better patient care and outcomes. Secondly, effective doctor engagement can improve communication and collaboration among healthcare professionals, enhancing the quality of healthcare services. Finally, engaged doctors can provide valuable insights and feedback, helping healthcare organizations to continuously improve their services and adapt to changing patient needs.

State of Doctor Engagement: Pre-Innovation Era

Traditionally, doctor engagement in healthcare was primarily focused on face-to-face interactions and personal relationships. Doctors were engaged through regular meetings, conferences, and direct communication with hospital administrators and other healthcare staff. While these methods were effective to some extent, they had several limitations.

One major limitation was the lack of scalability. As healthcare organizations grew and the number of doctors increased, it became challenging to maintain the same level of personal engagement with each doctor. Additionally, traditional engagement methods were often time-consuming and resource-intensive, making them unsustainable in the long term.

Another limitation was the lack of data-driven insights. Traditional engagement practices relied heavily on anecdotal evidence and personal experiences, which did not always provide a complete or accurate picture of doctor engagement levels. This made it difficult for healthcare organizations to measure the effectiveness of their engagement strategies and identify areas for improvement.

Furthermore, the pre-innovation era of doctor engagement often lacked customization and flexibility. Engagement strategies were typically one-size-fits-all, failing to account for the diverse needs and preferences of individual doctors. This lack of personalization could lead to disengagement among doctors who felt that their unique contributions and perspectives were not being valued.

Emerging Problems and the Need for Innovation

As the healthcare industry continued to evolve, several emerging problems highlighted the need for innovation in doctor engagement strategies. One significant issue was the increasing complexity of healthcare delivery. With advancements in medical technology and the growing diversity of patient needs, doctors were required to navigate more complex treatment options and care protocols. Traditional engagement methods often fell short in providing the support and resources needed to manage this complexity effectively.

Another problem was the rising demand for healthcare services, fueled by factors such as an aging population and the prevalence of chronic diseases. This increased demand put pressure on doctors, leading to burnout and dissatisfaction. Without effective engagement strategies, healthcare organizations struggle to retain skilled doctors and maintain high levels of patient care.

The digital transformation of healthcare also posed challenges for doctor engagement. The adoption of electronic health records (EHRs), telemedicine, and other digital tools required doctors to adapt to new ways of working. However, the lack of proper training and support for these digital tools often led to frustration and resistance among doctors, hindering their engagement.

Moreover, the shift towards value-based care, which focuses on patient outcomes rather than the volume of services provided, required a more collaborative approach to healthcare. Traditional doctor engagement methods were not always conducive to fostering teamwork and shared decision-making, making it difficult to align doctors with the goals of value-based care.

These emerging problems underscored the need for innovative solutions that could address the changing dynamics of healthcare delivery and support effective doctor engagement in the modern era.

Innovative Solutions: Transforming Doctor Engagement

In response to these challenges, a range of innovative solutions emerged to transform doctor engagement in healthcare. One key innovation was the development of digital platforms and tools designed specifically for doctor engagement. These platforms provided a centralized hub for communication, collaboration, and access to resources, making it easier for doctors to connect with their peers and stay informed about the latest developments in their field.

Another significant innovation was the use of data analytics and artificial intelligence (AI) in doctor engagement. By analyzing data on doctor behavior, preferences, and performance, healthcare organizations could gain insights into what drives doctor engagement and tailor their strategies accordingly. AI-powered tools could also help identify patterns and trends in doctor engagement, enabling proactive interventions to prevent disengagement.

Gamification techniques were also applied to doctor engagement, leveraging the principles of game design to make engagement activities more interactive and rewarding. For example, doctors could earn points or badges for participating in training sessions, contributing to research, or achieving certain performance metrics. This approach helped to motivate doctors and make engagement more enjoyable.

In addition, there was a growing emphasis on personalized engagement strategies that recognized the individual needs and preferences of doctors. Personalized communication, tailored training programs, and flexible engagement options allowed doctors to engage in ways that suited their unique circumstances and preferences.

These innovative solutions represented a significant shift in how healthcare organizations approached doctor engagement. By leveraging technology, data, and personalization, they could create more effective and sustainable engagement strategies that address the challenges of modern healthcare delivery.

To illustrate the impact of these innovative solutions, let’s examine some case studies of healthcare organizations that have successfully implemented new doctor engagement strategies:

Digital Collaboration Platform

A large hospital system introduced a digital collaboration platform for its doctors. This platform allowed physicians to easily communicate with each other, share knowledge, and access patient information securely. As a result, the hospital saw improved coordination among doctors, leading to better patient outcomes and increased doctor satisfaction. A real-world example can be given of Connect2Clinic, a doctors’ portal developed by Mantra Labs for Alkem Labs. The solution allows doctors to manage their patients efficiently with lots of handy features and effectively run operations. It is a complete clinic management solution.

AI-Driven Feedback Tool

Another healthcare provider implemented an AI-driven tool that collected and analyzed feedback from doctors in real time. This tool helped identify areas for improvement in hospital operations and doctor support services. By addressing these issues promptly, the healthcare provider was able to enhance doctor engagement and reduce turnover rates.

Personalized Learning Programs

A specialty clinic developed personalized learning programs for its doctors, offering courses and resources tailored to their interests and career goals. This approach led to higher participation rates in training programs and a more engaged medical staff who felt valued and supported in their professional development.

Challenges and Considerations in Implementing Innovations

While innovative solutions for doctor engagement offer numerous benefits, healthcare organizations may encounter challenges in their implementation. Here are some key considerations:

  1. Resistance to Change: Doctors, like any other professionals, may resist new technologies or processes. Addressing concerns, providing adequate training, and demonstrating the value of innovations are crucial steps in overcoming resistance.
  2. Integration with Existing Systems: New engagement tools must seamlessly integrate with existing healthcare systems, such as EHRs, to avoid disruption and ensure smooth operation.
  3. Data Privacy and Security: With the increased use of digital platforms, protecting patient and doctor data is paramount. Healthcare organizations must adhere to strict data privacy regulations and ensure robust security measures are in place.
  4. Cost and Resource Allocation: Implementing new technologies can be costly. Organizations must carefully plan their budgets and resources to support the adoption of innovative engagement strategies.
  5. Measuring Impact: It’s essential to have metrics in place to evaluate the effectiveness of engagement initiatives. Regular monitoring and adjustment of strategies based on data are necessary for long-term success.

Future of Doctor Engagement in Healthcare

Looking ahead, the future of doctor engagement in healthcare is likely to be shaped by ongoing technological advancements and evolving healthcare needs. Here are some potential trends:

  1. Increased Use of Telemedicine: The COVID-19 pandemic has accelerated the adoption of telemedicine. This trend is expected to continue, offering new opportunities for engaging doctors remotely.
  2. Personalized Engagement Platforms: As technology advances, we can expect more sophisticated platforms that offer personalized engagement experiences for doctors, tailored to their individual needs and preferences.
  3. Collaborative Healthcare Ecosystems: The future may see more integrated and collaborative healthcare ecosystems, where doctors, patients, and other stakeholders are closely connected through digital platforms, enhancing engagement and communication.
  4. Focus on Well-being: With growing awareness of doctor burnout, future engagement strategies may place a greater emphasis on supporting doctors’ well-being and work-life balance.
  5. Leveraging AI and Machine Learning: These technologies will continue to play a significant role in analyzing engagement data, predicting trends, and providing insights for improving doctor engagement strategies.

As healthcare continues to evolve, staying ahead of these trends and adapting engagement strategies accordingly will be crucial for healthcare organizations seeking to foster a highly engaged and motivated medical workforce.

Doctor engagement is a critical component of delivering high-quality healthcare. As the healthcare landscape evolves, so too must the strategies for engaging doctors. The innovations discussed in this blog, from digital collaboration platforms to personalized learning programs, offer promising solutions to the challenges of doctor engagement in the modern era.

The success stories and data presented highlight the tangible benefits of these innovative strategies, including improved patient outcomes, increased doctor satisfaction, and enhanced operational efficiency. However, healthcare organizations must navigate challenges such as resistance to change, data privacy concerns, and the integration of new technologies with existing systems.

Cancel

Knowledge thats worth delivered in your inbox

Loading More Posts ...
Go Top
ml floating chatbot