article thumbnail

YOLOv4: A Fast and Efficient Object Detection Model

Viso.ai

The YOLO Family of Models The first YOLO model was introduced back in 2016 by a team of researchers, marking a significant advancement in object detection technology. Convolution Layer: The concatenated feature descriptor is then passed through a Convolution Neural Network.

article thumbnail

Object Detection in 2024: The Definitive Guide

Viso.ai

Hence, rapid development in deep convolutional neural networks (CNN) and GPU’s enhanced computing power are the main drivers behind the great advancement of computer vision based object detection. Various two-stage detectors include region convolutional neural network (RCNN), with evolutions Faster R-CNN or Mask R-CNN.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Faster R-CNNs

PyImageSearch

You’ll typically find IoU and mAP used to evaluate the performance of HOG + Linear SVM detectors ( Dalal and Triggs, 2005 ), Convolutional Neural Network methods, such as Faster R-CNN ( Girshick et al., 2015 ; Redmon and Farhad, 2016 ), and others. 2015 ), SSD ( Fei-Fei et al., 2015 ; He et al., MobileNets ).

article thumbnail

The Complete Guide to OpenPose in 2025

Viso.ai

However, in recent years, human pose estimation accuracy achieved great breakthroughs with Convolutional Neural Networks (CNNs). The method won the COCO 2016 Keypoints Challenge and is popular for quality and robustness in multi-person settings. Pose Estimation is still a pretty new computer vision technology.

article thumbnail

Top Computer Vision Papers of All Time (Updated 2024)

Viso.ai

Today’s boom in computer vision (CV) started at the beginning of the 21 st century with the breakthrough of deep learning models and convolutional neural networks (CNN). GoogLeNet – Going Deeper with Convolutions (2014) The Google team (Christian Szegedy, Wei Liu, et al.) Find the SURF paper here.

article thumbnail

Embed, encode, attend, predict: The new deep learning formula for state-of-the-art NLP models

Explosion

2016) introduce an attention mechanism that takes two sentence matrices, and outputs a single vector: Yang et al. 2016) introduce an attention mechanism that takes a single matrix and outputs a single vector. 2016) presented a model that achieved 86.8% 2016) presented a model that achieved 86.8% 2016) HN-ATT 68.2

article thumbnail

4 Applications of Intelligent Waste Management [2025]

Viso.ai

billion tons of municipal solid waste was generated globally in 2016 with experts predicting a steep rise to 3.40 Object Detection : Computer vision algorithms, such as convolutional neural networks (CNNs), analyze the images to identify and classify waste types (i.e., As per the World Bank, 2.01 billion tons in 2050.