2016, Computer Vision and Convolutional Neural Networks

2016

Computer Vision

Convolutional Neural Networks

Object Detection in 2024: The Definitive Guide

Viso.ai

DECEMBER 3, 2023

This article will provide an introduction to object detection and provide an overview of the state-of-the-art computer vision object detection algorithms. Object detection is a key field in artificial intelligence, allowing computer systems to “see” their environments by detecting objects in visual images or videos.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network Deep Learning

Faster R-CNNs

PyImageSearch

NOVEMBER 13, 2023

For example, image classification, image search engines (also known as content-based image retrieval, or CBIR), simultaneous localization and mapping (SLAM), and image segmentation, to name a few, have all been changed since the latest resurgence in neural networks and deep learning. 2015 ; Redmon and Farhad, 2016 ), and others.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Deep Learning

Join 15,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Trending Sources

Top Computer Vision Papers of All Time (Updated 2024)

Viso.ai

MARCH 12, 2024

Today’s boom in computer vision (CV) started at the beginning of the 21 st century with the breakthrough of deep learning models and convolutional neural networks (CNN). In this article, we dive into some of the most significant research papers that triggered the rapid development of computer vision.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Deep Learning

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Computer Vision in Autonomous Vehicle Systems

Viso.ai

DECEMBER 6, 2024

Computer vision is a key component of self-driving cars. In this article, we’ll elaborate on how computer vision enhances these cars. To accomplish this, they require two key components: machine learning and computer vision. The eyes of the automobile are computer vision models.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Deep Learning

The Complete Guide to OpenPose in 2025

Viso.ai

OCTOBER 15, 2024

In the following, we will cover the following: Pose Estimation in Computer Vision What is OpenPose? provides the leading Computer Vision Platform, Viso Suite. Global organizations use it to develop, deploy, and scale all computer vision applications in one place. How does it work? How to Use OpenPose?

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network Deep Learning

4 Applications of Intelligent Waste Management [2025]

Viso.ai

OCTOBER 7, 2024

billion tons of municipal solid waste was generated globally in 2016 with experts predicting a steep rise to 3.40 This is where computer vision technology can help identify waste, separate it, and ensure its proper disposal. In this article, we will propose computer vision as an effective tool for waste management.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network Robotics

Embed, encode, attend, predict: The new deep learning formula for state-of-the-art NLP models

Explosion

NOVEMBER 9, 2016

2016) introduce an attention mechanism that takes two sentence matrices, and outputs a single vector: Yang et al. 2016) introduce an attention mechanism that takes a single matrix and outputs a single vector. Interestingly, most NLP models usually favour quite shallow feed-forward networks. 2016) recently published so exciting.

Deep Learning

Deep Learning NLP Convolutional Neural Networks Neural Network

YOLOv11: A New Iteration of “You Only Look Once”

Viso.ai

OCTOBER 15, 2024

In the field of real-time object identification, YOLOv11 architecture is an advancement over its predecessor, the Region-based Convolutional Neural Network (R-CNN). Using an entire image as input, this single-pass approach with a single neural network predicts bounding boxes and class probabilities.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Python

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Google Research AI blog

JANUARY 18, 2023

I will begin with a discussion of language, computer vision, multi-modal models, and generative machine learning models. Over the next several weeks, we will discuss novel developments in research topics ranging from responsible AI to algorithms and computer systems to science, health and robotics. Let’s get started!

Computer Vision

Computer Vision Auto-classification Large Language Models Neural Network

The Magic of AI Art: Understanding Neural Style Transfer

Viso.ai

JULY 17, 2024

An image can be represented by the relationships between the activations of features detected by a convolutional neural network (CNN). Fast Style Transfer (2016) While the previous model produced decent results, it was computationally expensive and slow. Johnson et al. What is Perceptual Loss?

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Deep Learning

YOLOv9: Advancements in Real-time Object Detection (2024)

Viso.ai

FEBRUARY 27, 2024

Farhadi, signifying a step forward in the real-time object detection space, outperforming its predecessor – the Region-based Convolutional Neural Network (R-CNN). It is a single-pass algorithm having only one neural network to predict bounding boxes and class probabilities using a full image as input. Divvala, R.

Neural Network

Neural Network Computer Vision Convolutional Neural Networks Python

YOLO Explained: From v1 to v11

Viso.ai

NOVEMBER 27, 2024

Object detection is a computer vision task that uses neural networks to localize and classify objects in images. Multiple machine-learning algorithms are used for object detection, one of which is convolutional neural networks (CNNs). Viso Suite is the end-to-End, No-Code Computer Vision Solution.

Explainability

Explainability Convolutional Neural Networks Neural Network Computer Vision

Multi-Modal Methods: Image Captioning (From Translation to Attention)

ML Review

JUNE 4, 2018

Recent Intersections Between Computer Vision and Natural Language Processing (Part Two) This is the second instalment of our latest publication series looking at some of the intersections between Computer Vision (CV) and Natural Language Processing (NLP). Attention can enable our inspection and debugging of networks.

Neural Network

Neural Network Convolutional Neural Networks Computer Vision Deep Learning

A Guide to YOLOv8 in 2024

Viso.ai

DECEMBER 18, 2023

YOLOv8 is the newest model in the YOLO algorithm series – the most well-known family of object detection and classification models in the Computer Vision (CV) field. offers the world’s leading end-to-end no-code Computer Vision Platform Viso Suite. Get a demo.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Algorithm

You are probably doing Medical Imaging AI the wrong way.

Mlearning.ai

JUNE 16, 2023

The ImageNet dataset, featuring natural images, contains 14,197,122 annotated images organized in 1000 classes and is commonly used as a benchmark for many computer vision models⁸. Practitioners first trained a Convolutional Neural Network (CNN) to perform image classification on ImageNet (i.e. December 10, 2016.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision AI AI

YOLOv7: The Most Powerful Object Detection Algorithm (2023 Guide)

Viso.ai

MAY 20, 2023

The YOLOv7 algorithm is making big waves in the computer vision and machine learning communities. It requires several times cheaper hardware than other neural networks and can be trained much faster on small datasets without any pre-trained weights. The original YOLO object detector was first released in 2016.

Algorithm

Algorithm Computer Vision Deep Learning Neural Network

Foundation models: a guide

Snorkel AI

MARCH 1, 2023

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Radford et al. 2016) This paper introduced DCGANs, a type of generative model that uses convolutional neural networks to generate images with high fidelity. Attention Is All You Need Vaswani et al.

BERT

BERT Natural Language Processing Large Language Models NLP

YOLOX Explained: Features, Architecture and Applications

Viso.ai

MARCH 11, 2024

Object detection is a fundamental task in computer vision, and YOLOX plays a fair role in improving it. YOLO in 2015 became the first significant model capable of object detection with a single pass of the network. The previous approaches relied on Region-based Convolutional Neural Network (RCNN) and sliding window techniques.

Explainability

Explainability Convolutional Neural Networks Neural Network Computer Vision

Home Robots: the Stanford’s Roadmap Paper

Viso.ai

MAY 1, 2024

Deep learning and Convolutional Neural Networks (CNNs) have enabled speech understanding and computer vision on our phones, cars, and homes. Stanford University and panel researchers P. Stone and R. Brooks et al. Brooks et al. Moreover, they can answer any question and communicate naturally.

Robotics

Robotics Convolutional Neural Networks Computer Vision Deep Learning

Dude, Where’s My Neural Net? An Informal and Slightly Personal History

Lexalytics

APRIL 5, 2021

While working as an RA in the computer vision group, I had the opportunity to sit in a robotic Humvee as it used Pomerleau’s code to drive around the University of Massachusetts’ stadium.) The CNN was a 6-layer neural net with 132 convolution kernels and (don’t laugh!) Hinton (again!)

Neural Network

Neural Network Convolutional Neural Networks Natural Language Processing BERT

Mastering Visual Question Answering with Deep Learning and Natural Language Processing: A Pocket-friendly Guide

John Snow Labs

MAY 23, 2023

Visual question answering (VQA), an area that intersects the fields of Deep Learning, Natural Language Processing (NLP) and Computer Vision (CV) is garnering a lot of interest in research circles. Its proposed neural architecture can provide fairly accurate answers to natural language questions about images.

Natural Language Processing

Natural Language Processing Deep Learning NLP Convolutional Neural Networks

ML and NLP Research Highlights of 2020

Sebastian Ruder

JANUARY 19, 2021

More recently, contrastive learning gained popularity in self-supervised representation learning in computer vision and speech ( van den Oord, 2018 ; Hénaff et al., 2016 ; Webster et al., 8) Image Transformers The Vision Transformer ( Dosovitskiy et al., 2020 ; Wallace et al., 2020 ; Carlini et al.,

NLP

NLP ML Computer Vision Natural Language Processing

sense2vec reloaded: contextually-keyed word vectors

Explosion

NOVEMBER 21, 2019

In 2016 we trained a sense2vec model on the 2015 portion of the Reddit comments corpus, leading to a useful library and one of our most popular demos. spaCy’s default token-vector encoding settings are a depth 4 convolutional neural network with width 96, and hash embeddings with 2000 rows. assert doc[3:6].text

NLP

NLP Convolutional Neural Networks Neural Network Natural Language Processing

AI News Weekly - Issue #339: Next DeepMind's Algorithm To Eclipse ChatGPT - Jun 29th 2023

AI Weekly

JUNE 29, 2023

In the News Next DeepMind's Algorithm To Eclipse ChatGPT IN 2016, an AI program called AlphaGo from Google’s DeepMind AI lab made history by defeating a champion player of the board game Go. June 15, 2023 /PRNewswire/ -- Quantum Computing Inc. ("QCi" Powered by pluto.fi

Algorithm

Algorithm ChatGPT Convolutional Neural Networks Robotics

Multi-Modal Methods: Visual Speech Recognition (Lip Reading)

ML Review

MAY 3, 2018

Recent Intersections Between Computer Vision and Natural Language Processing (Part One) This is the first instalment of our latest publication series looking at some of the intersections between Computer Vision (CV) and Natural Language Processing (NLP). 2016) — “ LipNet: End-to-End Sentence-level Lipreading.” [17]

Neural Network

Neural Network Computer Vision Deep Learning NLP

The 11 Top AI Influencers to Watch in 2024 (Guide)

Viso.ai

DECEMBER 21, 2023

Over the past decade, the field of computer vision has experienced monumental artificial intelligence (AI) breakthroughs. This blog will introduce you to the computer vision visionaries behind these achievements. Viso Suite is the end-to-End, No-Code Computer Vision Solution.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Deep Learning

Google builds UniAR, AirbnB uses ViTs!

Bugra Akyildiz

NOVEMBER 17, 2024

This uses advanced computer vision techniques, specifically a Vision Transformer model, to analyze and organize photos of properties. Vision Transformers(ViT) ViT is a type of machine learning model that applies the transformer architecture, originally developed for natural language processing, to image recognition tasks.

Convolutional Neural Networks

Convolutional Neural Networks Metadata Python Computer Vision

Artificial Intelligence Zone

Object Detection in 2024: The Definitive Guide

Faster R-CNNs

Webinars

Trending Sources

Top Computer Vision Papers of All Time (Updated 2024)

Webinars

Computer Vision in Autonomous Vehicle Systems

The Complete Guide to OpenPose in 2025

4 Applications of Intelligent Waste Management [2025]

Embed, encode, attend, predict: The new deep learning formula for state-of-the-art NLP models

YOLOv11: A New Iteration of “You Only Look Once”

Google Research, 2022 & Beyond: Language, Vision and Generative Models

The Magic of AI Art: Understanding Neural Style Transfer

YOLOv9: Advancements in Real-time Object Detection (2024)

YOLO Explained: From v1 to v11

Multi-Modal Methods: Image Captioning (From Translation to Attention)

A Guide to YOLOv8 in 2024

You are probably doing Medical Imaging AI the wrong way.

YOLOv7: The Most Powerful Object Detection Algorithm (2023 Guide)

Foundation models: a guide

YOLOX Explained: Features, Architecture and Applications

Home Robots: the Stanford’s Roadmap Paper

Dude, Where’s My Neural Net? An Informal and Slightly Personal History

Mastering Visual Question Answering with Deep Learning and Natural Language Processing: A Pocket-friendly Guide

ML and NLP Research Highlights of 2020

sense2vec reloaded: contextually-keyed word vectors

AI News Weekly - Issue #339: Next DeepMind's Algorithm To Eclipse ChatGPT - Jun 29th 2023

Multi-Modal Methods: Visual Speech Recognition (Lip Reading)

The 11 Top AI Influencers to Watch in 2024 (Guide)

Google builds UniAR, AirbnB uses ViTs!

Stay Connected