Remove 2016 Remove Convolutional Neural Networks Remove Explainability
article thumbnail

Faster R-CNNs

PyImageSearch

You’ll typically find IoU and mAP used to evaluate the performance of HOG + Linear SVM detectors ( Dalal and Triggs, 2005 ), Convolutional Neural Network methods, such as Faster R-CNN ( Girshick et al., 2015 ; Redmon and Farhad, 2016 ), and others. 2015 ), SSD ( Fei-Fei et al., 2015 ; He et al., MobileNets ).

article thumbnail

YOLO Explained: From v1 to v11

Viso.ai

Multiple machine-learning algorithms are used for object detection, one of which is convolutional neural networks (CNNs). YOLOv1 The Original Before introducing YOLO object detection, researchers used convolutional neural network (CNN) based approaches like R-CNN and Fast R-CNN.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

YOLOX Explained: Features, Architecture and Applications

Viso.ai

YOLO in 2015 became the first significant model capable of object detection with a single pass of the network. The previous approaches relied on Region-based Convolutional Neural Network (RCNN) and sliding window techniques. The post YOLOX Explained: Features, Architecture and Applications appeared first on viso.ai.

article thumbnail

GoogLeNet Explained: The Inception Model that Won ImageNet

Viso.ai

However, GoogLeNet demonstrated by using the inception module that depth and width in a neural network could be increased without exploding computations. GooLeNet – source Historical Context The concept of Convolutional Neural Networks ( CNNs ) isn’t new.

article thumbnail

Embed, encode, attend, predict: The new deep learning formula for state-of-the-art NLP models

Explosion

Over the last six months, a powerful new neural network playbook has come together for Natural Language Processing. This post explains the components of this new approach, and shows how they’re put together in two recent systems. 2016) introduce an attention mechanism that takes a single matrix and outputs a single vector.

article thumbnail

Multi-Modal Methods: Image Captioning (From Translation to Attention)

ML Review

These ideas also move in step with the explainability of results. If language grounding is achieved, then the network tells me how a decision was reached. In image captioning a network is not only required to classify objects, but instead to describe objects (including people and things) and their relations in a given image.

article thumbnail

The Magic of AI Art: Understanding Neural Style Transfer

Viso.ai

Output from Neural Style Transfer – source Neural Style Transfer Explained Neural Style Transfer follows a simple process that involves: Three images, the image from which the style is copied, the content image, and a starting image that is just random noise. Johnson et al. What is Perceptual Loss?