Sat.Jun 25, 2022 - Fri.Jul 01, 2022

article thumbnail

Stemming vs Lemmatization in NLP: Must-Know Differences

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction In the field of Natural Language Processing i.e., NLP, Lemmatization and Stemming are Text Normalization techniques. These techniques are used to prepare words, text, and documents for further processing. Languages such as English, Hindi consists of several words which are often derived […].

NLP 337
article thumbnail

Why You Should Write Weekly 15-5s

Eugene Yan

15 minutes a week to document your work, increase visibility, and earn trust.

100
100
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Automating Model Risk Compliance: Model Monitoring

DataRobot Blog

Monitoring Modern Machine Learning (ML) Methods In Production. In our previous two posts, we discussed extensively how modelers are able to both develop and validate machine learning models while following the guidelines outlined by the Federal Reserve Board (FRB) in SR 11-7. Once the model is successfully validated internally, the organization is able to productionize the model and use it to make business decisions.

article thumbnail

The Track Record of Futurists Seems. Fine

Cold Takes

Click lower right to download or find on Apple Podcasts, Spotify, Stitcher, etc. I've argued that the development of advanced AI could make this the most important century for humanity. A common reaction to this idea is one laid out by Tyler Cowen here : "how good were past thinkers at predicting the future? Don’t just select on those who are famous because they got some big things right." This is a common reason people give for being skeptical about the most important centur

AI 52
article thumbnail

How To Get Promoted In Product Management

Speaker: John Mansour

If you're looking to advance your career in product management, there are more options than just climbing the management ladder. Join our upcoming webinar to learn about highly rewarding career paths that don't involve management responsibilities. We'll cover both career tracks and provide tips on how to position yourself for success in the one that's right for you.

article thumbnail

How to Become a Blockchain Developer?

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Although blockchain is still in its infancy, the opportunities for developers to contribute are not just exciting but also many. Many businesses, including supply chains, automotive, and finance, have adopted blockchain, but it is not without problems. When a cryptocurrency, namely Bitcoin, […].

More Trending

article thumbnail

4 Business AI Predictions for 2022-2023

Kavita Ganesan

AI as a field, especially in the context of real-world applications, has been progressing at a rapid pace. This has been further accelerated by the onset of the COVID-19 pandemic. In fact, AI was found to be the most discussed technology in 2021. Having worked with numerous clients, big and small, in the integration of AI, here are 4 Business AI predictions in 2022 and beyond. #1 Many more “deployed” models In the recent past, businesses have had trouble operationalizing models and have not seen

article thumbnail

FIGS: Attaining XGBoost-level performance with the interpretability and speed of CART

BAIR

FIGS (Fast Interpretable Greedy-tree Sums): A method for building interpretable models by simultaneously growing an ensemble of decision trees in competition with one another. Recent machine-learning advances have led to increasingly complex predictive models, often at the cost of interpretability. We often need interpretability, particularly in high-stakes applications such as in clinical decision-making; interpretable models help with all kinds of things, such as identifying errors, leveraging

article thumbnail

Data Driven Culture: A Far-fetched Goal for Organizations

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Creating a collaborative, data-driven culture is one of the most important goals of many modern organizations. A data-driven culture is when data is used to make decisions at every level of the organization. A data-driven culture is about replacing the gut feeling […].

article thumbnail

20 SQL Coding Interview Questions

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction SQL stands for Structured Query Language. It’s a programming language to interact/query and manage RDBMS (Relational Database Management Systems). SQL skills are highly preferred and required as it’s used by many organizations in a large variety of software applications.

article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Introduction to Memcached using Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Memcached is a highly-performant distributed caching system. It is an in-memory key-value data store, which makes it a type of NoSQL database. Memcached is used by tech giants like Facebook, Twitter, Instagram, and Netflix. In my previous article, I explained Redis which […].

Python 323
article thumbnail

Custom Named Entity Recognition using spaCy v3

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction to Named Entity Recognition A named entity is a ‘real-world object’ that is assigned a name, for example, person, organization, or location. For more details, check my previous article on fine tune Bert for NER. All in all, NER can be summarized as […].

BERT 321
article thumbnail

Top 15 Important Data Science Interview Questions

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Source – Analytics India Magazine Introduction Job interviews can be scary if you are a fresher and especially if you are attending interviews on interdisciplinary roles like Data Science and Machine Learning. The tension, the doubt if you will get a yes or […]. The post Top 15 Important Data Science Interview Questions appeared first on Analytics Vidhya.

article thumbnail

Linear Algebra for Data Science With Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Linear Algebra, a branch of mathematics, is very much useful in Data Science. We can mathematically operate on large amounts of data by using Linear Algebra. Most algorithms used in ML use Linear Algebra, especially matrices. As most of the data is […]. The post Linear Algebra for Data Science With Python appeared first on Analytics Vidhya.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

A Guide to Exploratory Data Analysis Explained to a 13-year-old!

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction You might be wandering in the vast domain of AI, and may have come across the word Exploratory Data Analysis, or EDA for short. Well, what is it? Is it something important, if yes why? If you are looking for the answers […]. The post A Guide to Exploratory Data Analysis Explained to a 13-year-old!

article thumbnail

Understanding Transaction Management in SQL

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction to SQL A transaction is a set of operations carried out as a logical unit of work. It is a logical work unit that includes one or more SQL statements. A database system must ensure that transactions are properly executed even if […]. The post Understanding Transaction Management in SQL appeared first on Analytics Vidhya.

article thumbnail

Predicting the Toxicity of Comments Using Text Classification

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction We all check our email every day, possibly more than once. The majority of email service providers have the useful feature of automatically separating spam emails from other emails. This is an example of a common NLP problem called text classification. Text […]. The post Predicting the Toxicity of Comments Using Text Classification appeared first on Analytics Vidhya.

NLP 271
article thumbnail

Introduction to Tree Methods in MLIB

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction to MLIB Tree methods are one of the most efficient ways of handling both the classification and the regression problems. There are ample methods available to choose from like Decision Tree, Random Forest, and Gradient Boosting. In this article, we will use […]. The post Introduction to Tree Methods in MLIB appeared first on Analytics Vidhya.

article thumbnail

How to Improve Email Deliverability and Optimize Each Send

Learn how to optimize email deliverability and drive greater email ROI. What lands your email in the customer’s inbox? Understanding those factors, otherwise known as email deliverability, is critical to getting the most return on your campaign investments. But the “rules” around which factors land you in the spam folder aren’t always easy to keep up with.

article thumbnail

Build a Step-by-step Machine Learning Model Using R

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction to Machine Learning Model Machine learning is changing the approach of businesses in the world. Every company, large or small, aspires to find insight from the large amounts of data it stores and processes regularly. The desire to predict the future motivates […].

article thumbnail

Partitioning and Bucketing in Hive

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Hive is a popular data warehouse built on top of Hadoop that is used by companies like Walmart, Tiktok, and AT&T. It is an important technology for data engineers to learn and master. It uses a declarative language called HQL, also known […]. The post Partitioning and Bucketing in Hive appeared first on Analytics Vidhya.

article thumbnail

Getting Started with Linux File System

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. This article will discuss the essential and valuable commands required for managing Files and Directories in Linux Operating System. Prerequisites: You must have a Linux Operating System and some basic knowledge of Linux Shell. Source – [link] What is Linux? Linux is the most […].

article thumbnail

Predictive Analytics of Donors in Crowd Funding Platforms

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction to Predictive Analytics DonorsChoose.org is an online charity platform where thousands of teachers may submit requests through the online portals for materials and particular equipment to ensure that all kids have equal educational chances. The project is based on a Kaggle Competition […].

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Applications of Data Science Tools in Biopharmaceutical Industry

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Leading biopharmaceutical industries, start-ups, and scientists are integrating Machine Learning (ML) and Artificial Intelligence Learning (AIL) into R&D to analyze extensive large data & data sets, identify patterns, and generate algorithms to explain them. AIL is increasingly capable of predictive analytics and […].

article thumbnail

Everything About Apache Hive and its Advantages!

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. What is Apache Hive? Hive, founded by Facebook and later Apache, is a data storage system created for the purpose of analyzing structured data. Operating under an open-source data platform called Hadoop, Apache Hive is a software application released in 2010 (October). Introduced to […].

article thumbnail

The Technology of Automation and Artificial Intelligence

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Automation and intelligence are two technologies that are widely used in modern systems, making them cutting-edge. Without the two technologies of automation and intelligence, a computer system or program will revert to outdated design and development concepts, which are currently obsolete and […].

article thumbnail

Introduction to Redis Using Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Redis is a popular in-memory key-value data store, which is a type of NoSQL database. Redis is chiefly used as a cache database, but its application does not end there. You can find many articles explaining how Redis can be the all-in-one […]. The post Introduction to Redis Using Python appeared first on Analytics Vidhya.

Python 271
article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.

article thumbnail

Combating Data Inconsistencies with SQL

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Photo by Sebastian Herrmann on Unsplash Introduction As you know from my previous posts, I’ve started a mailing list to promote my blog. In the last post in this series, we designed a way to answer the question of what’s causing folks to […]. The post Combating Data Inconsistencies with SQL appeared first on Analytics Vidhya.

article thumbnail

Hands-on Beginner’s Guide to SQL

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction From the very beginning of my SQL journey, I have been fascinated by this database language. The ability of SQL to be able to update, modify, delete and store data with very few lines of code is the most engaging part of it. […]. The post Hands-on Beginner’s Guide to SQL appeared first on Analytics Vidhya.

article thumbnail

Movie Recommendation with SQL Using Google Cloud Platform

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Ever Wondered how Netflix gets to know your choice and shows the movie of your interest? Or have you ever think how Amazon shows your the recommended product based on your search item? So the magic behind these technologies is called the […]. The post Movie Recommendation with SQL Using Google Cloud Platform appeared first on Analytics Vidhya.