Applied Artificial Intelligence and Machine Learning
Introduction
Applied AI is a curated knowledge repository of artificial intelligence & machine learning use cases, best practices, lessons learned, tools, and techniques, adopted by leading technology or tech-savvy organizations.
Numerous organizations frequently share their insights and expertise, encompassing best practices, tools, and techniques that shape their engineering culture. They do this through various public platforms such as engineering blogs, conferences, and meetups. This repository compiles and presents content gathered from these sources.
Topics
- Artificial Intelligence
- Machine Learning
- Deep Learning
- Generative AI
- Large Language Models
- Prompt Engineering
- Retrieval-Augmented Generation (RAG)
- MLOps
- Data Engineering
- Responsible AI
Organizations
Airbnb
Blog Posts
2024
- Airbnb’s AI-powered photo tour using Vision Transformer
- Automation Platform v2: Improving Conversational AI at Airbnb
- Transforming Location Retrieval at Airbnb: A Journey from Heuristics to Reinforcement Learning
- Chronon, Airbnb’s ML Feature Platform, Is Now Open Source
- Airbnb Brandometer: Powering Brand Perception Measurement on Social Media Data with AI
2023
- Prioritizing Home Attributes Based on Guest Interest
- Learning To Rank Diversely
- Building Airbnb Categories with ML & Human in the Loop
- Wisdom of Unstructured Data: Building Airbnb’s Listing Knowledge from Big Text Data
2022
- How AI Text Generation Models Are Reshaping Customer Support at Airbnb
- Intelligent Automation Platform: Empowering Conversational AI and Beyond at Airbnb
2021
- Automating Data Protection at Scale, Part 2
2020
- Improving Deep Learning for Ranking Stays at Airbnb
Algolia
Blog Posts
2024
- Introducing AI Personalization (?)
- How large-language models are changing ecommerce
- What does it mean for AI search to “understand” customers?
- The future of B2B is powered by AI. Here’s how to get started.
- The definitive guide to semantic search engines
2023
- Feature Spotlight: Query Suggestions
Altera.ai
Blog Posts
2024
- Altera uses GPT-4o to build a new area of human collaboration
Asana
Blog Posts
2024
- Asana supercharges work management with Claude
Asos
Blog Posts
2024
- ASOS Tech GenAI Hack: Revolutionising requirements with GenAI
- Transforming Recommendations at ASOS
Autotrader
Blog Posts
2024
- Demystifying Large Language Models (LLM101)
- To App or Not to App (Rapid Data App Prototyping in the ML Experimentation Cycle)
- So many labels, so little time; accelerating our image labelling process
2023
- Demonstrating the Value of our Packages
2022
- Real-Time Personalisation of Search Results with Auto Trader's Customer Data Platform
BlaBlaCar
Blog Posts
2023
- How we used machine learning to fight fraud at BlaBlaCar — Part 1
- How we built our machine learning pipeline to fight fraud at BlaBlaCar — Part 2
- How BlaBlaCar leverages machine learning to match passengers and drivers - Part 2
- How BlaBlaCar leverages machine learning to match passengers and drivers - Part 1
Brian Impact Foundation
Blog Posts
2024
- Brian Impact Foundation powers their search for the next generation of social innovators with Claude
Canva
Blog Posts
2024
- Canva’s AI-powered Magic Studio used 5 billion times and counting.
2023
- Summarizing Post Incident Reviews with GPT-4
Coinbase
Blog Posts
2024
- Coinbase enhances customer support and operational efficiency with Claude
Dropbox
Blog Posts
2024
- Bye Bye Bye...: Evolution of repeated token attacks on ChatGPT models
- Bringing AI-powered answers and summaries to file previews on the web
- From AI to sustainability, why our latest data centers use 400G networking
2023
- Putting everything in its right place with ML-powered file organization
- Is this a date? Using ML to identify date formats in file names
- Dont you (forget NLP): Prompt injection with control characters in ChatGPT
- Accelerating our A/B experiments with machine learning
2021
- Optimizing payments with machine learning
- How image search works at Dropbox
- Cannes: How ML saves us $1.7M a year on document previews
eBay
Blog Posts
2024
- Background Enhancement Tool Turns Any Photo Into a Studio-Quality Product Image
- eBay's Responsible AI Principles
- Cutting Through the Noise: Three Things We've Learned About Generative AI and Developer Productivity
- Podcast: Nitzan Mekel-Bobrov on AI, the Future of Shopping Online, and the Value of Building In-House
2023
- New Social Caption Generator Uses AI to Help Sellers Post More Easily
- eBay Exec on How Artificial Intelligence Will Bring a ‘Paradigm Shift’ to Ecommerce
- ‘Magical’ Listing Tool Harnesses the Power of AI to Make Selling on eBay Faster, Easier, and More Accurate
- Evolving Recommendations: A Personalized User-Based Ranking Model
- Beyond Words: How Multimodal Embeddings Elevate eBay's Product Recommendations
- eBay Execs Talk Generative AI and Computer Vision at VentureBeat Transform Conference
- eBay’s Blazingly Fast Billion-Scale Vector Similarity Engine
- How eBay Created a Language Model With Three Billion Item Titles
2022
- Sherlock.io: An Upgraded Machine Learning Monitoring System
- Improving Shopping Recommendations for Customers Through eBay’s Relevance Cascade Model
- Building a Deep Learning Based Retrieval System for Personalized Recommendations
Estee Lauder
Blog Posts
2024
- Data-driven beauty: How The Estée Lauder Companies unlocks insights with ChatGP
Etsy
Blog Posts
2024
- Efficient Visual Representation Learning And Evaluation
2023
- How We Built a Multi-Task Canonical Ranker for Recommendations at Etsy
- Leveraging Real-Time User Actions to Personalize Etsy Ads - Research Paper
- From Image Classification to Multitask Modeling: Building Etsy’s Search by Image Feature
- Leveraging Real-Time User Actions to Personalize Etsy Ads
2022
- Deep Learning for Search Ranking at Etsy
Expedia
Blog Posts
2024
- Learning Embeddings for Lodging Travel Concepts
- Traveling Just Got a Lot Smarter with Romie
- Choosing the Right Candidates for Lodging Ranking
2023
- Using Synthetic Search Data for Flights Price Forecasting
- Expedia Group’s Customer Lifetime Value Prediction Model
- Generating Diverse Travel Recommendations
- Increasing Travelers’ Engagement Through Price Alerts
- Candidate Generation Using a Two Tower Approach With Expedia Group Traveler Data
2022
- Categorising Customer Feedback Using Unsupervised Learning
- How to Optimise Rankings with Cascade Bandits
2021
- Personalized Ranking Model for Lodging
GitHub
Blogs
2024
- Unlocking the power of unstructured data with RAG
- What is retrieval-augmented generation, and what does it do for generative AI?
- Hard and soft skills for developers coding in the age of AI
- How AI code generation works
- Fixing security vulnerabilities with AI
- A developer’s second brain: Reducing complexity through partnership with AI
2023
- How we’re experimenting with LLMs to evolve GitHub Copilot
- The architecture of today’s LLM applications
- Demystifying LLMs: How they can do things they weren’t trained to do
- How to build an enterprise LLM application: Lessons from GitHub Copilot
- A developer’s guide to prompt engineering and LLMs
- Inside GitHub: Working with the LLMs behind GitHub Copilot
- How companies are boosting productivity with generative AI
- How generative AI is changing the way developers work
- Generative AI-enabled compliance for software development
- What developers need to know about generative AI
2022
- Exciting new GitHub features powering machine learning
GitLab
Blog Posts
2024
- GitLab enhances productivity with Claude
- Building GitLab with GitLab: A multi-region service to deliver AI features
GoDaddy
Blogs
2024
- Generative AI Domain Search
- From Text to Transformers: How LLMs Are Enhancing GoDaddy’s CMS Experience
- LLM From the Trenches: 10 Lessons Learned Operationalizing Models at GoDaddy
- Everything I’ve Learned Building an AI App
Grab
Blog Posts
2024
- LLM-assisted vector similarity search
- Leveraging RAG-powered LLMs for Analytical Tasks
- Evolution of Catwalk: Model serving platform at Grab
- Enabling conversational data discovery with LLMs at Grab
- Unveiling the process: The creation of our powerful campaign builder
- LLM-powered data classification for data entities at scale
Gumroad
Blog Posts
2024
- Gumroad’s customer support team ships code with Claude
Instacart
Blog Posts
2024
- Enhancing FoodStorm with AI Image Generation
- Distinguished Speaker Series with Ping Li: ML-Enhanced Sparse Vector Search with Privacy Protection
- Unveiling the Core of Instacart’s Griffin 2.0: A Deep Dive Into the Model Serving Platform
- Unlocking Efficiency: How Ava Became Our AI Productivity Partner
2023
- One model to serve them all
- Monte Carlo, Puppetry and Laughter: The Unexpected Joys of Prompt Engineering
- Unveiling the Core of Instacart’s Griffin 2.0: A Deep Dive into the Machine Learning Training Platform
- Introducing Griffin 2.0: Instacart’s Next-Gen ML Platform
- Scaling Productivity with Ava — Instacart’s Internal AI Assistant
- Supercharging ML/AI Foundations at Instacart
- Adopting dbt as the Data Transformation Tool at Instacart
- The Next Era of Data at Instacart
- How Instacart Modernized the Prediction of Real Time Availability for Hundreds of Millions of Items While Saving Costs
LinkedIn
Blog Posts
2024
- Enhancing LinkedIn’s security posture management with AI-driven insights
- Under the hood: the tech behind the first agent from LinkedIn, Hiring Assistant
Mercado Libre
Blog Posts
2024
- Mercado Libre introduces Verdi, an AI developer platform powered by GPT-4o
- Accelerating the AI-Driven Future with Data Governance at the Wheel
- Beyond the Hype: Real-World Lessons and Insights from Working with Large Language Models
Mercari
Blog Posts
2024
- Fine-Tuning an LLM to Extract Dynamically Specified Attributes
- LMM based Approach to Large-scale Item Category Classification
2023
- LM-based query categorization for query understanding
- Leveraging LLMs in Production: Looking Back, Going Forward
- The Bitter Lesson about Engineers in a ChatGPT World
- Putting the Voice of Customers into the Software Development Process
- Mercari’s Journey Integrating AI & Search at Berlin Buzzwords 2023
- Improving Item Recommendation Accuracy Using Collaborative Filtering and Vector Search Engine
- Model management for client side ML powered by Firebase
- The Journey to Machine-Learned Re-ranking
- Do We Need Engineers in a ChatGPT World?
Meta
Blog Posts
Meta AI Blog and Research Papers
- Various AI/ML research papers submitted by Meta are available here
- Meta AI blog
2024
- Leveraging AI for efficient incident response
- Maintaining large-scale AI capacity at Meta
- Our next-generation Meta Training and Inference Accelerator
- PVF: A novel metric for understanding AI systems’ vulnerability against SDCs in model parameters
- Optimizing RTC bandwidth estimation with machine learning
- Logarithm: A logging engine for AI training workflows and services
- Building Meta’s GenAI Infrastructure
- Improving machine learning iteration speed with faster application build and packaging
- Lazy is the new fast: How Lazy Imports and Cinder accelerate machine learning at Meta
- How Meta is advancing GenAI
- Serverless Jupyter Notebooks at Meta
- Building custom silicon for the future of AI
2023
- Introducing Code Llama, a state-of-the-art large language model for coding
- Watch: Meta’s engineers on building network infrastructure for AI
- How Meta is creating custom silicon for AI
- AI debugging at Meta with HawkEye
- Arcadia: An end-to-end AI system performance simulator
- MTIA v1: Meta’s first-generation AI inference accelerator
2022
- Scaling data ingestion for machine learning training at Meta
2021
- Fully Sharded Data Parallel: faster AI training with fewer GPUs
- Asicmon: A platform agnostic observability system for AI accelerators
- How machine learning powers Facebook’s News Feed ranking algorithm
Microsoft
Blog Posts
2024
Morgan Stanley
Blog Posts and Sites
2024
- Morgan Stanley wealth management deploys GPT-4 to organize its vast knowledge base
- Machine Learning Research
Notion
Blog Posts
2024
- Notion creates more intelligent workspaces with Claude
- AI is the new plastic
- Meet the new Notion AI. Get to know what it can do for you.
Nubank
Blog Posts
2024
- AI at Nubank: how one of the largest digital banks in the world uses artificial intelligence
Scribd
Blog Posts
2024
- Scribd, Inc. boosts content discovery and engagement with Claude
Slack
Blog Posts
2024
- Empowering Engineers with AI
- Balancing Old Tricks with New Feats: AI-Powered Conversion From Enzyme to React Testing Library at Slack
- How We Built Slack AI To Be Secure and Private
Slalom Build
Blog Posts
2024
- The Secret to Success in Large-Scale Data Engineering Projects
- Generative AI Prompt Engineering: A Balancing Act
- RAG for Quality Engineers
- Feeding AI Models with Slack Data
- Keeping the Human in the Loop
2023
- Elevating Conversational AI with Dynamic Data and APIs via OpenAI Function Calling
- Behind the Large Language Models: Word Embedding
Sourcegraph
Blog Posts
2024
- Sourcegraph enhances the intelligence and speed of their AI-powered coding assistant with Claude
Uber
Blog Posts
2024
- Open Source and In-House: How Uber Optimizes LLM Training
- Genie: Uber’s Gen AI On-Call Copilot
- QueryGPT – Natural Language to SQL Using Generative AI
- Personalized Marketing at Scale: Uber’s Out-of-App Recommendation System
- DataK9: Auto-categorizing an exabyte of data at field level through AI/ML
- From Predictive to Generative – How Michelangelo Accelerates Uber’s AI Journey
- DragonCrawl: Generative AI for High-Quality Mobile Testing
- Scaling AI/ML Infrastructure at Uber
- Stopping Uber Fraudsters Through Risk Challenges
- Model Excellence Scores: A Framework for Enhancing the Quality of Machine Learning Systems at Scale
2023
- The Transformative Power of Generative AI in Software Development: Lessons from Uber’s Tech-Wide Hackathon
- Innovative Recommendation Applications Using Two Tower Embeddings at Uber
- Demand and ETR Forecasting at Airports
- Risk Entity Watch – Using Anomaly Detection to Fight Fraud
- Accelerating Advertising Optimization: Unleashing the Power of Ads Simulation
- uVitals – An Anomaly Detection & Alerting System
2022
- Project RADAR: Intelligent Early Fraud Detection System with Humans in the Loop
- DeepETA: How Uber Predicts Arrival Times Using Deep Learning
- Uber’s Real-Time Document Check
- How Uber Optimizes the Timing of Push Notifications using ML and Linear Programming
- ML Education at Uber: Program Design and Outcomes
- ML Education at Uber: Frameworks Inspired by Engineering Principles
2021
- Applying Machine Learning in Internal Audit with Sparsely Labeled Data
Walmart
Blog Posts
2024
- Managing Secure API Access to LLMs in Distributed Systems with Dataflow
- Creating Web App For File Interactions Using RAG: A Developers Guide
- Exploring the World of Vector Databases: A Comprehensive Guide
- Using Predictive and Gen AI to Improve Product Categorization at Walmart
- AI-Driven Continuous Monitoring: The Future of Third-Party Risk Management
- Build your own GPT (BYO-GPT)
- Textual Titans: A Large Language Model Odyssey
- Evaluation of RAG Metrics using RAGA
- Deploying RAGs in production — Part 2
- Deploying RAGs in production — Part 1
- Extracting Product Attributes from PDFs using PAE Framework
- Augmentation Techniques for Imbalanced text Classification
- Transforming Text Classification with Semantic Search Techniques — Faiss
2023
- Machine Learning Platform at Walmart
- Training Large Language Model (LLM) on your data
2022
- Saving and Retrieving ML Models Using PySpark in Cloud Platform
Wix
Blog Posts
2024
- Wix streamlines website content creation with GPT.
- Real-World Forecasting with Deep Learning: How We Do It at Wix
- Customizing LLMs for Enterprise Data Using Domain Adaptation: The Wix Journey
- AI for Revolutionizing Customer Care Routing System at Wix
- SageMaker Batch Transform Unleashed: My Journey at Wix to Achieve Scalable ML
- Beyond Content Generation: AI-Based Layout Generation for Graphic Design
Other Resources
Case Studies from vendors
Following resources from AI vendors like OpenAI, Anthropic, and Databricks provide valuable insights into how businesses are leveraging AI to solve complex challenges and drive innovation.
- OpenAI
- Anthropic
- Databricks
ML and LLM System Design Catalog
- ML and LLM system design: 500 case studies to learn from
Contributors
Contribute
Contributions welcome! Read the contribution guidelines first.
Stargazers Over Time
License
To the extent possible under law, Unmesh Gundecha has waived all copyright and
related or neighboring rights to this work.
If you decide to use this anywhere, please credit @upgundecha on X. Also, if you like my work, check out my other projects on GitHub.