2014 and Computer Vision - Artificial Intelligence Zone

AI Acquisitions: Who’s Leading the Charge and Why?

Unite.AI

JANUARY 9, 2024

Apple prioritizes computer vision , natural language processing , voice recognition, and healthcare to enhance its products. Likewise, Microsoft strengthens its cloud and enterprise software through acquisitions in natural language processing , computer vision , and cybersecurity.

Natural Language Processing

Natural Language Processing Computer Vision Robotics AI

Allen Institute for Artificial Intelligence (AI2) Announces New CEO

Allen AI

JUNE 20, 2023

Founded in 2014, AI2 is the research institute created by the late philanthropist Paul G. Allen School of Computer Science & Engineering at University of Washington, Farhadi’s research impact has been globally recognized with several best paper awards at CVPR, NeruIPS, AAAI, NSF Career Award, and the Sloan Fellowship.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Computer Vision AI Researcher

Introduction of Neural Style Transfer – A Pioneer in Generative AI

Towards AI

APRIL 23, 2024

In computer vision, there is an area called domain adaptation or style transfer which generates a new image by mixing up specific attributes from different images. However, generative models is not a new term and it has come a long way since Generative Adversarial Network (GAN) was published in 2014 [1].

Computer Vision

Computer Vision Generative AI Deep Learning AI

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Computer Vision in Autonomous Vehicle Systems

Viso.ai

DECEMBER 6, 2024

Computer vision is a key component of self-driving cars. In this article, we’ll elaborate on how computer vision enhances these cars. To accomplish this, they require two key components: machine learning and computer vision. The eyes of the automobile are computer vision models.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Deep Learning

Crack Detection in Concrete

Towards AI

JULY 19, 2023

Photo by Maud CORREA on Unsplash Computer Vision Using Computer Vision Introduction Crack detection is crucial in monitoring the health of infrastructural buildings. Therefore, Now we conquer this problem of detecting the cracks using image processing methods, deep learning algorithms, and Computer Vision.

Computer Vision

Computer Vision Deep Learning Categorization Algorithm

Top Computer Vision Papers of All Time (Updated 2024)

Viso.ai

MARCH 12, 2024

Today’s boom in computer vision (CV) started at the beginning of the 21 st century with the breakthrough of deep learning models and convolutional neural networks (CNN). In this article, we dive into some of the most significant research papers that triggered the rapid development of computer vision.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Deep Learning

Computer Vision Tasks (Comprehensive 2024 Guide)

Viso.ai

DECEMBER 6, 2023

Computer vision (CV) is a rapidly evolving area in artificial intelligence (AI), allowing machines to process complex real-world visual data in different domains like healthcare, transportation, agriculture, and manufacturing. Future trends and challenges Viso Suite is an end-to-end computer vision platform.

Computer Vision

Computer Vision Convolutional Neural Networks Neural Network Categorization

Overview of Important GAN Models & Applications

Towards AI

NOVEMBER 1, 2023

I offer data science mentoring sessions and long-term career mentoring: Generative adversarial networks (GANs) have revolutionized image synthesis since their introduction in 2014.

Neural Network

Neural Network Computer Vision Data Science AI

Computer Vision for Cultural Heritage Preservation: Unlocking the Past with Advanced Imaging…

Heartbeat

OCTOBER 9, 2023

Computer Vision for Cultural Heritage Preservation: Unlocking the Past with Advanced Imaging Technology Image Source: Technology Innovators Preserving our cultural legacy is critical because it allows us to remain in touch with our past, learn our roots, and appreciate humanity's rich history.

Computer Vision

Computer Vision Algorithm Deep Learning Machine Learning

AI Emotion Recognition and Sentiment Analysis (2025)

Viso.ai

OCTOBER 9, 2024

AI emotion recognition is a very active current field of computer vision research that involves facial emotion detection and the automatic assessment of sentiment from visual data and text analysis. provides the end-to-end computer vision platform Viso Suite. About us: Viso.ai

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Emotion AI

Object Detection in 2024: The Definitive Guide

Viso.ai

DECEMBER 3, 2023

This article will provide an introduction to object detection and provide an overview of the state-of-the-art computer vision object detection algorithms. Object detection is a key field in artificial intelligence, allowing computer systems to “see” their environments by detecting objects in visual images or videos.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network Deep Learning

Active learning is the future of generative AI: Here’s how to leverage it

Flipboard

FEBRUARY 28, 2023

More posts by this contributor 4 questions to ask before building a computer vision model During the past six months, we have witnessed some incredible developments in AI. These problems are why, despite the early promise and floods of investment, technologies like self-driving cars have been just one year away since 2014.

Generative AI

Generative AI ML Engineer Computer Vision Robotics

Digging Into Various Deep Learning Models

Pickl AI

JANUARY 26, 2025

Applications in Computer Vision CNNs dominate computer vision tasks such as object detection, image classification, and facial recognition. Introduced by Ian Goodfellow in 2014, GANs are designed to generate realistic data, such as images, videos, and audio, that mimic real-world datasets.

Deep Learning

Deep Learning Neural Network Convolutional Neural Networks Natural Language Processing

Human Pose Estimation with Deep Learning – Ultimate Overview in 2024

Viso.ai

DECEMBER 3, 2023

Pose estimation is a fundamental task in computer vision and artificial intelligence (AI) that involves detecting and tracking the position and orientation of human body parts in images or videos. provides the leading end-to-end Computer Vision Platform Viso Suite. Get a demo for your organization.

Deep Learning

Deep Learning Computer Vision Neural Network Robotics

Faster R-CNNs

PyImageSearch

NOVEMBER 13, 2023

The original Faster R-CNN paper used VGG (Simonyan and Zisserman, 2014) and ZF (Zeiler and Fergus, 2013) as the base networks. Do you think learning computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or requires a degree in computer science? Join me in computer vision mastery.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Deep Learning

Researchers from China Unveil ImageReward: A Groundbreaking Artificial Intelligence Approach to Optimizing Text-to-Image Models Using Human Preference Feedback

Marktechpost

OCTOBER 6, 2023

ImageReward aligns consistently with human preference ranking and exhibits superior distinguishability across models and samples compared to FID and CLIP scores on prompts from actual users and MS-COCO 2014. • For fine-tuning diffusion models concerning human preference scores, they suggest Reward Feedback Learning (ReFL).

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Natural Language Processing NLP

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

You can use state-of-the-art model architecturessuch as language models, computer vision models, and morewithout having to build them from scratch. These pre-trained models serve as powerful starting points that can be deeply customized to address specific use cases. billion to a projected $574.78 billion in 2017 to a projected $37.68

Machine Learning

Machine Learning Large Language Models Python Automation

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Google Research AI blog

JANUARY 18, 2023

I will begin with a discussion of language, computer vision, multi-modal models, and generative machine learning models. Over the next several weeks, we will discuss novel developments in research topics ranging from responsible AI to algorithms and computer systems to science, health and robotics. Let’s get started!

Computer Vision

Computer Vision Auto-classification Large Language Models Neural Network

Researchers from MIT and Adobe Introduce Distribution Matching Distillation (DMD): An Artificial Intelligence Method to Transform a Diffusion Model into a One-Step Image Generator

Marktechpost

DECEMBER 6, 2023

on MS-COCO 2014-30k using the same denoiser architecture as Stable Diffusion. Their one-step generator performs much better than known few-step diffusion methods on all benchmarks, including Consistency Models, Progressive Distillation, and Rectified Flow. DMD achieves FIDs of 2.62 on ImageNet, outperforming the Consistency Model by 2.4×.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Neural Network AI Research

ClimDetect: A New Benchmark Dataset for Testing AI Models in Detecting Climate Change Signals

Marktechpost

SEPTEMBER 14, 2024

It includes data from 28 climate models and 142 model runs, covering historical (1850-2014) and future scenarios (SSP2-4.5, ClimDetect is a dataset with 816,000 daily climate snapshots from the CMIP6 model ensemble, designed to enhance D&A studies of climate signals.

AI Modeling

AI Modeling Machine Learning Deep Learning AI

Foundational vision models and visual prompt engineering for autonomous driving applications

AWS Machine Learning Blog

NOVEMBER 15, 2023

In recent years, the field of computer vision has witnessed significant advancements in the area of image segmentation. We can also get the bounding boxes from smaller models, or in some cases, using standard computer vision tools. Sujitha Martin is an Applied Scientist in the Generative AI Innovation Center (GAIIC).

Prompt Engineering

Prompt Engineering Prompt Engineer Computer Vision Machine Learning

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

Chatbots Life

MAY 16, 2023

Deeper Insights Year Founded : 2014 HQ : London, UK Team Size : 11–50 employees Clients : Smith and Nephew, Deloitte, Breast Cancer Now, IAC, Jones Lang-Lasalle, Revival Health. Services : AI Solution Development, ML Engineering, Data Science Consulting, NLP, AI Model Development, AI Strategic Consulting, Computer Vision.

Generative AI

Generative AI Chatbots Conversational AI Software Development

A Deep Dive into Variational Autoencoders with PyTorch

PyImageSearch

OCTOBER 2, 2023

Course information: 80 total classes • 105+ hours of on-demand code walkthrough videos • Last updated: September 2023 ★★★★★ 4.84 (128 Ratings) • 16,000+ Students Enrolled I strongly believe that if you had the right teacher you could master computer vision and deep learning. Or requires a degree in computer science?

Computer Vision

Computer Vision Deep Learning Neural Network Auto-complete

Pascal VOC Dataset: A Technical Deep Dive (2024 Guide)

Viso.ai

MAY 31, 2024

Pascal VOC is a renowned dataset and benchmark suite that has significantly contributed to the advancement of computer vision research. It provides standardized image data sets for object class recognition and a common set of tools for accessing the data and evaluating the performance of computer vision models.

Computer Vision

Computer Vision Convolutional Neural Networks Deep Learning Neural Network

Multi-Modal Methods: Image Captioning (From Translation to Attention)

ML Review

JUNE 4, 2018

Recent Intersections Between Computer Vision and Natural Language Processing (Part Two) This is the second instalment of our latest publication series looking at some of the intersections between Computer Vision (CV) and Natural Language Processing (NLP). 2014)[ 73 ] and Donahue et al.

Neural Network

Neural Network Convolutional Neural Networks Computer Vision Deep Learning

Scaling Diffusion transformers (DiT): An AI Framework for Optimizing Text-to-Image Models Across Compute Budgets

Marktechpost

OCTOBER 19, 2024

The study validates scaling laws on out-of-domain datasets using the COCO 2014 validation set. To validate these laws, they extrapolate to a 1.5e21 FLOPs budget, training a 958.3M parameter model that closely matches predicted loss.

Inference Engine

Inference Engine Large Language Models Artificial Intelligence Artificial Intelligence

2022H2 Amazon Textract launch summary

AWS Machine Learning Blog

DECEMBER 29, 2022

He has 20+ years of experience with internet-related technologies, engineering and architecting solutions and joined AWS in 2014, first guiding some of the largest AWS customers on most efficient and scalable use of AWS services and later focused on AI/ML with a focus on computer vision and at the moment is obsessed with extracting information from (..)

Automation

Automation Computer Vision ML Machine Learning

Keeping an eye on your cattle using AI technology

AWS Machine Learning Blog

OCTOBER 17, 2023

According to a 2014 study, the proportion of severely lame cows in China can be as high as 31 percent. He specializes in Computer Vision (CV) and Visual-Language Model (VLM). Tianjun Xiao is a senior applied scientist at the AWS AI Shanghai Lablet, co-leading the computer vision efforts.

Algorithm

Algorithm Computer Vision AI AI

Personalize your generative AI applications with Amazon SageMaker Feature Store

AWS Machine Learning Blog

OCTOBER 6, 2023

Next, we recommend “Interstellar” (2014), a thought-provoking and visually stunning film that delves into the mysteries of time and space. He is passionate about computer vision, NLP, generative AI, and MLOps. With its groundbreaking special effects and memorable characters, this movie is a must-see for any fan of the genre.

Generative AI

Generative AI LLM Natural Language Processing Metadata

Robustness of a Markov Blanket Discovery Approach to Adversarial Attack in Image Segmentation: An…

Mlearning.ai

MARCH 9, 2023

In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2012; Otsu, 1979; Long et al., 2013; Goodfellow et al., IEEE transactions on pattern analysis and machine intelligence, 33(5), 898–916. Goodfellow, I.

Deep Learning

Deep Learning Computer Vision Machine Learning Algorithm

Convolutional Neural Networks: A Deep Dive (2024)

Viso.ai

JANUARY 2, 2024

In the following, we will explore Convolutional Neural Networks (CNNs), a key element in computer vision and image processing. Viso Suite enables the use of neural networks for computer vision with no code. Le propose architectures that balance accuracy and computational efficiency. Learn more and request a demo.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Data Scarcity

AlphaPose: A Comprehensive Guide to Pose Estimation

Viso.ai

JUNE 19, 2024

AlphaPose is a multi-person pose estimation model that uses computer vision and deep learning techniques to detect and predict human poses from images and videos in real time. About us: Viso Suite provides full-scale features to rapidly build, deploy, and scale enterprise-grade computer vision applications.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision Neural Network Deep Learning

You are probably doing Medical Imaging AI the wrong way.

Mlearning.ai

JUNE 16, 2023

The ImageNet dataset, featuring natural images, contains 14,197,122 annotated images organized in 1000 classes and is commonly used as a benchmark for many computer vision models⁸. This “transfer learning” approach was so successful that it became the de-facto standard for solving a broad range of computer vision problems.

Convolutional Neural Networks

Convolutional Neural Networks Computer Vision AI AI

Explain text classification model predictions using Amazon SageMaker Clarify

AWS Machine Learning Blog

JANUARY 25, 2023

Apart from supporting explanations for tabular data, Clarify also supports explainability for both computer vision (CV) and natural language processing (NLP) using the same SHAP algorithm. It is constructed by selecting 14 non-overlapping classes from DBpedia 2014.

Explainability

Explainability Algorithm Natural Language Processing Machine Learning

Implement smart document search index with Amazon Textract and Amazon OpenSearch

AWS Machine Learning Blog

SEPTEMBER 8, 2023

He joined AWS in 2014, first guiding some of the largest AWS customers on the most efficient and scalable use of AWS services, and later focused on AI/ML with a focus on computer vision. He has over 20 years of experience with internet-related technologies, engineering, and architecting solutions.

IDP

IDP Automation Python ML

ML Days in Tashkent — Day 1: City Tour

PyImageSearch

DECEMBER 4, 2023

Built in 2014 along the Ankhor Canal, it’s fondly known as the “Snow Mosque” due to its pristine white marble construction. The keras and keras_cv imports are for using Keras and its computer vision extensions, respectively. Here, we are using the “torch” backend. Join the Newsletter!

ML

ML Machine Learning Computer Vision Python

StyleGAN Explained: Revolutionizing AI Image Generation

Viso.ai

JULY 9, 2024

StyleGAN is GAN (Generative Adversarial Network), a Deep Learning (DL) model, that has been around for some time, developed by a team of researchers including Ian Goodfellow in 2014. Since the development of GANs, the world saw several models introduced every year that got nearer to generating real images.

Explainability

Explainability Computer Vision Convolutional Neural Networks Neural Network

An Exploratory Look at Vector Embeddings

Mlearning.ai

JULY 31, 2023

2014; Bojanowski et al., Traditionally, Computer Vision tasks use several Convolutional layers to extract significant features by iterating over the image using a fixed-sized box (kernel). Instead, why not use a set of embeddings that are already trained? Sometimes, this can be easier and much faster. So, what’s the alternative?

Deep Learning

Deep Learning Computer Vision Algorithm ML

AI Distillery (Part 1): A bird’s eye view of AI research

ML Review

MARCH 5, 2019

It, of course, includes the work we have done manually in our previous two survey publications: A Year in Computer Vision and Multi-Modal Methods. Crafting a dataset The number of papers added to ArXiv per month since 2014. In 2018, over 1000 papers have been released on ArXiv per month in the above areas.

AI Researcher

AI Researcher AI Research Neural Network AI

The Magic of AI Art: Understanding Neural Style Transfer

Viso.ai

JULY 17, 2024

GANs based Models GANs were first introduced in 2014 and have been modified for use in various applications, style transfer being one of them. Enterprise AI Viso Suite infrastructure makes it possible for enterprises to integrate state-of-the-art computer vision systems into their everyday workflows.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Computer Vision Deep Learning

1x1 Convolution: Explainer

Mlearning.ai

AUGUST 4, 2023

In this blog, we will try to deep dive into the concept of 1x1 convolution operation which appeared in the paper ‘Network in Network’ by Lin et al in (2013) and ‘Going Deeper with Convolutions’ by Szegedy et al (2014) that proposed the GoogLeNet architecture.

Explainability

Explainability Neural Network Deep Learning AI

A Guide to Convolutional Neural Networks

Heartbeat

AUGUST 21, 2023

GoogLeNet: is a highly optimized CNN architecture developed by researchers at Google in 2014. This helps avoid disappearing gradients in very deep networks, allowing ResNet to attain cutting-edge performance on a wide range of computer vision applications.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Natural Language Processing Deep Learning

Neuromorphic Engineering: Developing Brain-Inspired Machines

Viso.ai

JUNE 26, 2024

This article will discuss the following: Neuromorphic Engineering and its core principles History and Development Algorithms Used How Neuromorphic Algorithms differ from Traditional Algorithms Real-world examples Applications and Use Cases About Us: At Viso.ai, we power Viso Suite, the most complete end-to-end computer vision platform.

Neural Network

Neural Network Algorithm Robotics Computer Vision

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Since 2014, the company has been offering customers its Philips HealthSuite Platform, which orchestrates dozens of AWS services that healthcare and life sciences companies use to improve patient care. This is a joint blog with AWS and Philips.

Data Scientist

Data Scientist ML Data Science Automation

AI Acquisitions: Who’s Leading the Charge and Why?

Allen Institute for Artificial Intelligence (AI2) Announces New CEO

Webinars

Trending Sources

Introduction of Neural Style Transfer – A Pioneer in Generative AI

Webinars

Computer Vision in Autonomous Vehicle Systems

Crack Detection in Concrete

Top Computer Vision Papers of All Time (Updated 2024)

Computer Vision Tasks (Comprehensive 2024 Guide)

Overview of Important GAN Models & Applications

Computer Vision for Cultural Heritage Preservation: Unlocking the Past with Advanced Imaging…

AI Emotion Recognition and Sentiment Analysis (2025)

Object Detection in 2024: The Definitive Guide

Active learning is the future of generative AI: Here’s how to leverage it

Digging Into Various Deep Learning Models

Human Pose Estimation with Deep Learning – Ultimate Overview in 2024

Faster R-CNNs

Researchers from China Unveil ImageReward: A Groundbreaking Artificial Intelligence Approach to Optimizing Text-to-Image Models Using Human Preference Feedback

Llama 4 family of models from Meta are now available in SageMaker JumpStart

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Researchers from MIT and Adobe Introduce Distribution Matching Distillation (DMD): An Artificial Intelligence Method to Transform a Diffusion Model into a One-Step Image Generator

ClimDetect: A New Benchmark Dataset for Testing AI Models in Detecting Climate Change Signals

Foundational vision models and visual prompt engineering for autonomous driving applications

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

A Deep Dive into Variational Autoencoders with PyTorch

Pascal VOC Dataset: A Technical Deep Dive (2024 Guide)

Multi-Modal Methods: Image Captioning (From Translation to Attention)

Scaling Diffusion transformers (DiT): An AI Framework for Optimizing Text-to-Image Models Across Compute Budgets

2022H2 Amazon Textract launch summary

Keeping an eye on your cattle using AI technology

Personalize your generative AI applications with Amazon SageMaker Feature Store

Robustness of a Markov Blanket Discovery Approach to Adversarial Attack in Image Segmentation: An…

Convolutional Neural Networks: A Deep Dive (2024)

AlphaPose: A Comprehensive Guide to Pose Estimation

You are probably doing Medical Imaging AI the wrong way.

Explain text classification model predictions using Amazon SageMaker Clarify

Implement smart document search index with Amazon Textract and Amazon OpenSearch

ML Days in Tashkent — Day 1: City Tour

StyleGAN Explained: Revolutionizing AI Image Generation

An Exploratory Look at Vector Embeddings

AI Distillery (Part 1): A bird’s eye view of AI research

The Magic of AI Art: Understanding Neural Style Transfer

1x1 Convolution: Explainer

A Guide to Convolutional Neural Networks

Neuromorphic Engineering: Developing Brain-Inspired Machines

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

Stay Connected