Old notes repo
AI
Machine learning
Applied ML advice
Reinforcement learning foundations
Optimization
Fine-tuning
Deep learning foundations
AI research
General deep learning research
Transformer research
Large language models (LLMs)
More LLMs
Multimodal vision models
Video vision models
Image generation
Diffusion (DDPM)
Image processing
Video generation
Audio generation
Speech generation
Reinforcement learning research
Building deep learning models
ML systems research and engineering
LLM scaling
LLM scaling algorithms (systems)
LLM scaling cases / data
Inference performance optimization
GPU computing
Megatron-DeepSpeed
gpt-neox
gpt-neox MoE design doc
Megablocks
Applied LLMs
Pytorch
More specific codebases
llama2.c
Sweep codebase
AI research questions
Small models / scaling down LLMs
Datasets
LLM evals
AI organizations
Reasoning
Non-LLM deep learning and software engineering
LLM applications
Agents
LLMs for coding
Codex
Sourcegraph Cody
axolotl
LLM code snippets
Dev scratch journal
Applied Stable Diffusion (classic)
Computer science and engineering
Algorithms
Algorithm problems
Dynamic programming problems
Systems
Database systems
Operating systems
Dataflow systems
Computer architecture
Programming languages
Compilers
Information retrieval
Sandboxing
Server sandboxing:
https://yz.mit.edu/posts/server-sandboxing/
Browser sandboxing
Distributed systems
CRDTs
Operational transform
Editing DAGs concurrently
Concepts
Cache maintenance:
https://yz.mit.edu/posts/cache-maintenance/
Commutativity
Vectorization
Concept relationships and bottom-up derivations
Math
Old math notes
Probability problems
Linear algebra
Importance sampling
Development
General development
SQL alternatives
Node.js
Python
Python package management
Web development
MIME, multipart, binary encodings
Typescript
React patterns
Next.js
JS Monorepos (NX, Lerna, etc.)
JS package management, npm, yarn, pnpm
Wordpress