An Awesome List of Awesomes
This is a simple aggregation of all of the "Awesome --Topic name--" github repos I've found till date and I feel are important to get started in the corresponding Topic.
The topics are relevant to Data lifecycle, Machine Learning, Deep learning research and some distributed computing.
Note: Not all of these links are actively maintained but some of them may serve as good starting points.
There are multiple lists for certain topics which may or may not have common links, I have added them with a serial number under the topic in no particular order.
Topic wise ML and DL research
- Math
- Data augmentation
- Data Augmentation link 1
- Data Augmentation link 2
- Data Augmentation review
- Multitask learning
- Diffusion models
- Diffusion models link 1
- Diffusion models link 2
- Stable Diffusion
- Self supervised learning
- Self supervised learning link 1
- Self supervised learning link 2
- Semi supervised learning
- Weakly Supervised Learning
- Weak Supervision
- Weakly Supervised Image Segmentation link 1
- Weakly Supervised Image Segmentation link 2
- Learning with Label noise
- Adversarial ML/DL
- Adversarial ML
- Adversarial Examples for Deep learning
- Architecture Search
- Contrastive self supervised learning
- Zero shot learning
- Zero shot learning link 1
- Zero shot learning link 2
- One shot learning
- Few shot learning
- Few shot learning link 1
- Few shot learning link 2
- Siamese networks
- Image Classification
- Contrastive learning
- Visual transformers
- Transformers for vision
- Transformers in Medical Imaging
- Transformers
- Transformers link 1
- Treasure-of-Transformers
- Transformers link 2
- OpenSetRecognition list
- Incremental learning
- Meta Learning
- Meta learning link 1
- Meta learning link 2
- Deep learning uncertainty
- Semantic segmentation
- Instance Segmentation
- Box supervised Instance Segmentation
- Referring Segmentation
- Image Delineation
- Long tailed recognition/learning
- Long tailed recognition link 1
- Long tailed recognition link 2
- Long tailed recognition link 3
- Image matting
- Image inpainting
- Image Harmonization
- Face
- Face Recognition
- Face related resources
- Conformal predictions
- Scene understanding
- Panoptic Segmentation
- Panoptic Segmentation link 1
- Panoptic Segmentation link 2
- Object Tracking and Detection
- Object Detection link 1
- Object Detection link 2
- Object Tracking and Detection
- Object Detection papers
- YOLO Object detection
- Image Denoising
- Image Denoising link 1
- Image Denoising link 2
- Image Distortion correction
- Image Deblurring
- Continual Learning/Lifelong learning
- Multimodal learning
- Multimodal learning link 1
- Multimodal learning link 2
- Multimodal Conversational AI
- Active learning
- Active learning link 1
- Active learning link 2
- Deep Reinforcement Learning
- Deep Reinforcement Learning Link 1
- Deep Reinorcement Learning Link 2
- Knowledge Distillation
- Knowledge distillation link 1
- Knowledge distillation link 2
- Anomaly detection
- Local Global Descriptors
- Image Captioning
- Image Captioning link 1
- Image Captioning link 2
- Image to Image translation
- Image Registration
- Image Registration
- Point Cloud registration link 1
- Point Cloud registration link 2
- 2D-3D Matching
- Text To Speech
- Text to Image synthesis
- Text to Image link 1
- Text to Image link 2
- Deep HDR Image synthesis
- Speech recognition & synthesis
- Speaker Diarization
- Video Analysis
- Video analysis
- Video Action Recognition
- Video understanding
- Video Object Detection
- Video Text Retrieval
- Video Generation
- Pose Estimation
- Pose estimation link 1
- Pose estimation link 2
- Machine translation
- Visual Question Answering (VQA)
- Visual Question Answering link 1
- Visual Question Answering link 2
- Question Answering (QA)
- Question Answering link 1
- Question Answering link 2
- Question Answering link 3
- Vision and Language Pretrained models
- Vision-Language Navigation
- Explainable Graph Reasoning
- Knowledge Graph Question Answering
- Sentence embeddings
- Embedding Models
- Text Summarization
- Text Summarization link 1
- Text Summarization link 2
- Text Summarization link 3
- Optical Character Recognition (OCR)
- Optical Character Recognition (OCR) link 1
- Optical Character Recognition (OCR) link 2
- Document understanding
- Graph Neural Networks (GNN)
- Graph Neural Networks (GNN) link 1
- Graph Neural Networks (GNN) link 2
- Graph Neural Networks (GNN) link 3
- Expressive GNN
- Generative Adversarial Networks (GAN)
- Generative Adversarial Networks (GAN) link 1
- Generative Adversarial Networks (GAN) link 2
- GAN Training
- GAN Applications
- AI Art Image Synthesis
- Generative Modelling
- Optical Flow
- 360 vision
- Image Alignment and Stitching
- Energy based models
- Decision Trees
- Gradient Boosting Models
- XGBoost
- Gradient Boosting Machines (GBMs)
- Gradient Boosting
- Metric Learing
- Recommendation Systems
- Recommendation Systems link 1
- Recommendation Systems link 2
- Recommendation Systems link 3
- Point Cloud Analysis
- Pruning
- Neural Ordinary Differential Equations (ODE)
- Autonomous Vehicles (FSD)
- Autonomous Vehicles (FSD) link 1
- Autonomous Vehicles (FSD) link 2
- Robotics
- Curriculum Learning
- Causal Inference/ML
- Causal Inference/ML Link 1
- Causal Inference/ML Link 2
- Causal Inference/ML Link 3
- Causal Inference Visual Guides
- Satellite Image Deep Learning
- Transfer learning, Domain Adaption & Domain Generalization
- Transfer learning, Domain Adaption & Domain Generalization link 1
- Transfer learning, Domain Adaption & Domain Generalization link 2
- Transfer learning, Domain Adaption & Domain Generalization link 3
- Transfer learning, Domain Adaption & Domain Generalization link 4
- Domain Adaption - Object detection
- Image Restoration
- Variational Autoencoders
- Bayesian inference/Bayesian DL
- Bayesian inference/Bayesian DL link 1
- Bayesian inference/Bayesian DL link 2
- Deep Geometrical Learning
- Drug Discovery
- Representation Learning
- Disentangled Representations
- Disentangled Representations link 1
- Disentangled Representations link 2
- Time series
- Time series link 1
- Time series link 2
- Unsupervised/Weakly supervised learning
- Neural Rendering
- Neural Radiance Fields
- Neural Art & Style Transfer
- Neural Art
- Style Transfer
- Deepfakes
- Deepfakes link 1
- Deepfakes link 2
- Makeup transfer
- Audio/Music
- Inverse Graphics
- Model Quantization
- Model Quantization link 1
- Model Quantization link 2
- Quantum Machine Learning
- Game AI
- Game AI link 1
- Game AI link 2
- Biomedical ML
- Biomedical ML
- ML in Biomed Healthcare imaging
- Neuron segmentation in EM images
- Medical imaging
- Financial ML/Quantitative Finance
- Financial ML
- Quantitative Finance
- AI in Finance
- Embodied Vision
- Embedded and Mobile ML
- Mobile Machine Learning
- Embedded and Mobile ML
- 3D Machine Learning
- AutoML
- AutoML link 1
- AutoML link 2
- AutoML link 3
- Chatbot projects
- Chatbot projects link 1
- Chatbot projects link 2
- Conversational AI
- AI for climate change
- AI for climate change link 1
- AI for climate change link 2
- Federated Learning
- Federated Learning link 1
- Federated Learning link 2
- Federated Machine Learning
- Federated Computing
- Relational Extraction
- Attention Mechanisms in Computer Vision
- Attention Mechanism in Computer Vision
- Vision Attentions Survey
- Transformers in Computer Vision
- Masked Image Modelling
- Table Recognition
- Imbalanced Learning
- Music Generation
- 3D Generation
- Small Molecule - Drug Discovery ML
- Low light Image Enhancement
- DehazeZoo
- ML for cybersecurity
- Various Attention implementations in PyTorch
- Transformer Attention
- Radiology Report generation
- Vision and Language
- Machine Learning for Combinatorial Optimization
- Dataset Distillation/Condensation
- Virtual Try on
- Protein Design using DL
- Novel Class Discovery
- Music Informatics
- Discovery of Physical Laws
- CLIP (Contrastive Language-Image Pre-Training)
- Content Based Image Retrieval
- Content Based Image Retrieval Link 1
- Content Based Image Retrieval Link 2
- Topic Modeling
- Vector Search
- Clinical NLP
- Medical Coding NLP
- Content Moderation
- Pretrained Chinese NLP models
- Computer Vision in the Wild
- Mixture of Experts
- Reasoning in Large Language Models
- LLM Robotics
- Networks Beyond Attention
- Matching tasks
- Semantic Search
- Semantic Search link 1
- Semantic Search link 2
- GAN Inversion
- Dynamic Graph Learning
- Molecular Generation
- Failed ML
- Generative AI Applications
- ChatGPT
- Prompt Engineering
- Generative AI
- Conditional Content Generation
- Large Language Models (LLM)
- Large Language Models (LLM) link 1
- Large Language Models (LLM) link 2
- LLM Zoo
- LLM Practical Guide
- LLM Data hub
- LLM Robotics
- LangChain
- Reinforcement Learning with Human Feedback(RLHF)
- LLMOps
- Instruction Dataset
- Totally open ChatGPT
- 6D Object Pose Estimation and Reconstruction
- Awesome Anything
- Pretrained models for Information Retrieval
- AI Tools for Game development
- Human Video Generation
- Medical Vision-Language Models
- Video Diffusion
- Multimodal Large Language Models
Generic ML, DL, CV, NLP lists
- Data collection search engines
- Google Dataset Search
- Kaggle
- Harvard Dataverse
- Mendeley Data
- Academic Torrents
- Deep Learning Drizzle
- Deep Vision
- Deep Learning
- Machine Learning
- Computer vision
- Image processing applications with OpenCV
- DL papers
- DL paper reading roadmap
- ML Acronyms
- Applied Deep Learning
- NLP
- NLP link 1
- NLP link 2
- NLP link 3
- Jupyter
- Jupyter
- Naas.ai Jupyter Templates for data projects
- Jupyter lab
- GPT 3
- GPT 3 link 1
- GPT 3 link 2
- Made With ML
- Guide to Pytorch Learning Rate Scheduling
- PyTorch
- PyTorch link 1
- PyTorch link 2
- PyTorch link 3
- TensorFlow
- TensorFlow link 1
- TensorFlow link 2
- JAX
- 500 AI ML projects
- Tensorflow.js
- HuggingFace
- FastAI
- FastAI link 1
- FastAI link 2
- NLP recipes/Best practices
- Computer vision recipes/Best practices
- Time Series Forecasting - Best practices
- Access amd modify layes of pretrained models in PyTorch
- ML in Digital Signal Processing
- Paper and literature search websites, twitter accounts
- Papers With Code
- Crossminds
- DeepAI
- Arxiv Daily
- Math for ML
- ML Papers explained
- Transfer Learning
Data and ML lifecycle, ML Infra
- Data Collection
- Public APIs
- Robotics Datasets
- Medical Imaging Datasets
- Robotic Tooling
- Public Datasets
- Public Datasets link 1
- Public Datasets link 2
- Web Scraping
- Web Scraping link 1
- Web Scraping link 2
- Data Engineering
- Data Engineering link 1
- Data Engineering link 2
- Feature Engineering
- Feature Engineering link 1
- Feature Engineering link 2
- SQL
- SQL Link 1
- SQL Link 2
- SQL Link 3
- SQLAlchemy
- NoSQL
- NoSQL link 1
- NoSQL link 2
- DynamoDB
- MongoDB
- Firebase
- Firebase link 1
- Firebase link 2
- Redis
- Redis
- Redis optimizations
- Apache Spark
- BigQuery
- BigData
- BigData link 1
- BigData link 2
- Workflow Engines/Pipelines/DAG schedulers
- Workflow Engines
- Pipeline
- Apache Airflow
- Pandas
- Dask
- Graph Databases
- Data Annotation
- Data Labelling link 1
- Data Labelling link 2
- Data Labelling link 3
- Data Labelling link 4
- Data Labelling link 5
- Data Visualization
- Data Visualization
- Dash/Plotly
- Tools to Design or Visualize Architecture of Neural Network
- Power BI
- Power BI link 1
- Power BI link 2
- Network analysis
- Software Engineering for Machine Learning
- Scikit Learn
- Production level ML/DL
- Production level DL link 1
- Production level DL link 2
- Production level ML
- ML system design
- Applied ML
- ML Infrastructure
- MLOps
- MLOps link 1
- MLOps link 2
- MLOps link 3
- AIOps
- Explainable AI(XAI)
- Explainable AI (XAI) link 1
- Explainable AI (XAI) link 2
- ML Interpretability
- Fairness in AI
- Kaggle Solutions
- Ethical AI Guidelines
- SageMaker
- CoreML models
- Terraform
- DataBricks
- DataBricks link 1
- DataBricks official resources
- ML stack of blogs of companies
- Colab notebooks
- ML Libraries and tooling
Cool books for modern ML, DL applications
- ML Design Patterns
- Distributed ML Patterns
- Deep Learning Design Patterns
- PyTorch Computer Vision Cookbook - 70 recipes
System Design, Distributed systems and more
- System Design
- System Design link 1
- System Design link 2
- RESTful API
- Dev Ops Exercises
- Webhooks
- Webhooks link 1
- Webhooks link 2
- ASGI
- Flask
- Django
- Django link 1
- Django link 2
- Django link 3
- FastAPI
- Distributed systems
- Python
- Python link 1
- Python link 2
- F-ing awesome python
- GraphQL
- GraphQL link 1
- GraphQL link 2
- GraphQL link 3
- Scientific Python
- CUDA
- Parallel Computing
- Docker
- Docker Compose
- Kubernetes
- Kubernetes link 1
- Kubernetes link 2
- Kubernetes link 3
- K8s Security
- Ansible
- Ansible link 1
- Ansible link 2
- High Performance Computing (HPC)
- Microservices
- Microservices link 1
- Microservices link 2
- Domain Driven Design
- Low Level Design Primer
- System Design Primer
- Design Patterns
- Software Architecture
- Asyncio Python
- Knowledge graphs
- Knowledge graphs link 1
- Knowledge graphs link 2
- GRPC
- Peer-to-Peer
- RabbitMQ
- Kafka
- Kafka link 1
- Kafka link 2
- Kafka link 3
- Kafka link 4
- ML-tooling: Best of Python
- Amazon Web Services (AWS)
- Google Cloud Platform (GCP) Certifications
- Azure
- Digital Ocean (DO)
Other Awesome lists
Note: These lists are related to general concepts in Software architecture. They may or may not be completely related to Data lifecycle, ML and DL topics.
- Full Text Search
- MapReduce
- Build Your Own X
- Falsehoods
- CTO
- Raspberry Pi
- Creative Coding
- Site Reliability Engineering(SRE)
- Web Security
- Chaos Engineering
- ElasticSearch
- Database Internals
- Cryptography
- Tech Stacks
- Geographic Information System(GIS)
- GeoSpatial Tools
- IoT
- Quantum Computing
- Quantum Computing
- Quantum software
- Regex
- Regex link 1
- regex link 2
- CS Video courses
- ML-YouTube Courses
- AI Expert Roadmap
- Hacking
- Concepts and Laws
- CS books and Digests
- Self Hosted SaaS solutions
- Search
- UI
Contributing
Check CONTRIBUTING.md
TODOs