Machine Intelligence - 2024

A chronological archive of Machine Intelligence reading list posts.

2024

December 2024


November 2024


October 2024


September 2024


August 2024


July 2024

πŸ”— Overcoming the limits of current LLM

πŸ”— AI achieves silver-medal standard solving International Mathematical Olympiad problems

πŸ”— Pop Culture

πŸ”— Will We Run Out of Data? Limits of LLM Scaling Based on Human-Generated Data

πŸ”— LLM101n: Let's build a Storyteller

πŸ”— Gentleness and the artificial Other

πŸ”— Introducing RouteLLM: A Cost-Effective LLM Router Framework

πŸ”— Investors Are Suddenly Getting Very Concerned That AI Isn't Making Any Serious Money

πŸ”— Gen AI: too much spend, too little benefit?

πŸ”— SITUATIONAL AWARENESS - The Decade Ahead

πŸ”— The VR winter continues

πŸ”— exo Project Repository

πŸ”— Mapping the landscape of gen-AI product user experience

πŸ”— Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

πŸ”— Language models on the command-line

πŸ”— The Engineer’s Guide To Deep Learning

πŸ”— Korvus: Unified Search SDK

πŸ”— OpenAI working on new reasoning technology under code name β€˜Strawberry’

πŸ”— Does Refusal Training in LLMs Generalize to the Past Tense?

πŸ”— Garandor Securing Digital Identity and Copyrights

πŸ”— Do the Returns to Software R&D Point Towards a Singularity?

πŸ”— A Model of a Mind

πŸ”— Non-Obvious Prompt Engineering Guide

πŸ”— Yohana: The Ultimate Concierge for Busy Families

πŸ”— Purposefully Teaching Future Readiness with Alex Zarifeh

πŸ”— General Theory of Neural Networks

πŸ”— Who Wins the AI Value Chain?

πŸ”— Tammy Lovin Β· Sora Showcase

πŸ”— AI Scribes: Investment Thesis by a16z

πŸ”— The AI Summer

πŸ”— Introducing Eureka Labs

πŸ”— Kijai's LivePortrait


June 2024

πŸ”— Introducing Generative Physical AI -- youtube.com

πŸ”— What We Learned from a Year of Building with LLMs (Part I) (oreilly.com)

πŸ”— Transforming Customer Support and Sales with Mendable's AI Solutions

πŸ”— Why Apple is Taking a Small-Model Approach to Generative AI

πŸ”— Achieving the Self-Thinking Business (linkedin.com)

πŸ”— Will We Run Out of Data? Limits of LLM Scaling Based on Human-Generated Data

πŸ”— Back To Atoms

πŸ”— The Next Great Scientific Theory is Hiding Inside a Neural Network

πŸ”— Why we no longer use LangChain for building our AI agents:

πŸ”— Here’s what’s really going on inside an LLM’s neural network (arstechnica.com)

πŸ”— My personal AI research agenda, mid 2024 (and a pitch for work)

πŸ”— What's the future for generative AI? The Turing Lectures with Mike Wooldridge (youtube.com)

πŸ”— Generative AI Handbook: A Roadmap for Learning Resources -- genai-handbook.github.io

πŸ”— What is the biggest challenge in our industry? (thrownewexception.com)

πŸ”— Grounding - Enhance GEN AI with YOUR DATA (youtube.com)

πŸ”— I Will Piledrive You If You Mention AI Again

πŸ”— If AI Can Do Your Job, Maybe It Can Also Replace Your CEO (nytimes.com)

πŸ”— Reverse Turing Test Experiment with AIs

πŸ”— The Future of AI

πŸ”— Scaling Monosemanticity - Extracting Interpretable Features from Claude 3 Sonnet (transformer-circuits.pub)

πŸ”— Sober AI is the Norm

πŸ”— Gen AI Testing and Evaluation with ARTKIT

πŸ”— Can LLMs invent better ways to train LLMs?

πŸ”— SWE-bench: Can Language Models Resolve Real-World GitHub Issues?


May 2024

πŸ”— Diffusion Models | Paper Explanation | Math Explained (youtube.com)

πŸ”— How LLMs Work, Explained Without Math (miguelgrinberg.com)

πŸ”— Daniel Dennett 'Where Am I?' (thereader.mitpress.mit.edu)

πŸ”— Introduction to gpt-4o (openai.com)

πŸ”— OpenAI Releases Its First AI-Made Music Video (techchilli.com)

πŸ”— Anthropic - Prompt Engineering (thenameless.net)

πŸ”— 5 Big Myths Of AI and Machine Learning Debunked (splunk.com)

πŸ”— Refusal in LLMs is mediated by a single direction (lesswrong.com)

πŸ”— I Want Flexible Queries, Not RAG (win-vector.com)

πŸ”— Diffusion Models - Paper Explanation - Math Explained (youtube.com)

πŸ”— alessiodm-drl-zh - Deep Reinforcement Learning - Zero to Hero (github.com)

πŸ”— Ways to think about AGI (ben-evans.com)

πŸ”— GenAI in 2024 - Another Decade in One Year? (medium.com)

πŸ”— I will never go back - Ontario family doctor says new AI notetaking saved her job (globalnews.ca)

πŸ”— AlphaFold 3 predicts the structure and interactions of all of life’s molecules (blog.google)

πŸ”— John Carmack's 'Different Path' to Artificial General Intelligence (dallasinnovates.com)

πŸ”— Apple to unveil AI-enabled Safari with iOS 18 & macOS 15 (appleinsider.com)

πŸ”— State-of-the-Art Exact Binary Vector Search for RAG in 100 lines of Julia (domluna.com)

πŸ”— AI and problems of scale (ben-evans.com)

πŸ”— Ilya 30u30 AI papers for John Carmack (arc.net)

πŸ”— OpenAI to Challenge Google with Its Own Search Engine in May (beebom.com)

πŸ”— I'm Bearish OpenAI (stovetop.substack.com)

πŸ”— Malleable software in the age of LLMs (geoffreylitt.com)


April 2024

πŸ”— The Question No LLM Can Answer

πŸ”— Embeddings are a good starting point for the AI curious app developer

πŸ”— Generative AI is still a solution in search of a problem

πŸ”— Financial Market Applications of LLMs

πŸ”— What Computers Cannot Do: The Consequences of Turing-Completeness

πŸ”— Natural language instructions induce compositional generalization in networks of neurons

πŸ”— Adobe Is Buying Videos for $3 Per Minute to Build AI Model

πŸ”— How (Specifically) AI Will 100x Human Creativity and Output

πŸ”— GR00T: NVIDIA's moonshot to solve embedded AI

πŸ”— The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey

πŸ”— Outset.ai: Revolutionizing User Surveys with GPT-4

πŸ”— Bland.ai Turbo

πŸ”— OpenAPI AutoSpec

πŸ”— Dify

πŸ”— An Agentic Design for AI Consciousness

πŸ”— Emergent Mind

πŸ”— The Pipe

πŸ”— LLM in a flash: Efficient Large Language Model Inference with Limited Memory

πŸ”— CometLLM: Logging and Visualizing LLM Prompts

πŸ”— Making Deep Learning Go Brrrr From First Principles

πŸ”— RAFT: A new way to teach LLMs to be better at RAG

πŸ”— Anthropic's Prompt Engineering Interactive Tutorial

πŸ”— Thoughts on the Future of Software Development

πŸ”— The Death of the Big 4: AI-Enabled Services Are Opening a Whole New Market

πŸ”— Full Steam Ahead: The 2024 MAD (Machine Learning, AI & Data) Landscape

πŸ”— Comprehensive Guide to AI Tools and Resources

πŸ”— Accelerating AI Image Generation with MIT's Novel Framework

πŸ”— Here’s Proof You Can Train an AI Model Without Slurping Copyrighted Content

πŸ”— Awesome Code AI

πŸ”— Cheat Sheet: 5 prompt frameworks to level up your prompts

πŸ”— 3Blue1Brown: Neural Networks

πŸ”— How to unleash the power of AI, with Ethan Mollick

πŸ”— Screen Recording to Code

πŸ”— MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

πŸ”— Replicate.com

πŸ”— OpenAI Create Batch

πŸ”— More Agents Is All You Need

πŸ”— Looking for AI Use Cases

πŸ”— What Is an AI Anyway? | Mustafa Suleyman | TED

πŸ”— LLM inference speed of light

πŸ”— This prompting technique is insanely useful

πŸ”— CoreNet: A Library for Training Deep Neural Networks

πŸ”— Transformer Math 101

πŸ”— VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

πŸ”— Stanford AI Index Report 2024: An In-depth Analysis of AI's Current State

πŸ”— Hello OLMo: A truly open LLM

πŸ”— Enhancing GPT's Response Accuracy Through Embeddings-Based Search


March 2024

πŸ”— The Problem of Human Specialness in the Age of AI

πŸ”— ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

πŸ”— Top AIs still fail IQ tests

πŸ”— Gold-Medalist Coders Build an AI That Can Do Their Job for Them

πŸ”— AI startups require new strategies: This time it’s actually different

πŸ”— Introducing TripoSR: Fast 3D Object Generation from Single Images

πŸ”— Stable Diffusion 3: Research Paper

πŸ”— Project GR00T

πŸ”— The Era of Abstraction & New Creative Tensions

πŸ”— Autogenerating a Book Series From Three Years of iMessages

πŸ”— As Nvidia hits $2 trillion, billionaire Marc Rowan’s asset manager Apollo calls AI a β€˜bubble’ worse than even theΒ dotcomΒ era

πŸ”— Yann Lecun: Meta AI, Open Source, Limits of LLMs, AGI & the Future of AI | Lex Fridman PodcastΒ #416

πŸ”— noi

πŸ”— Meta is building a giant AI model to power its β€˜entire video ecosystem,’ exec says

πŸ”— LLM Prompt Injection Worm

πŸ”— pg_vectorize: a VectorDB for Postgres

πŸ”— The Expanding Dark Forest and Generative AI

πŸ”— You can now train a 70b language model at home

πŸ”— Oxen.ai Blog

πŸ”— Building Meta’s GenAI Infrastructure

πŸ”— An Introduction to Knowledge Graphs

πŸ”— A generalist AI agent for 3D virtual environments

πŸ”— Large language models can do jaw-dropping things. But nobody knows exactly why

πŸ”— "AI, no ads please": 4 words to wipe out $1tn

πŸ”— Anthropic’s Claude 3 causes stir by seeming to realize when it was being tested

πŸ”— Defending LLMs against Jailbreaking Attacks via Backtranslation


February 2024

πŸ”— Klarna says its AI assistant does the work of 700 people after it laid off 700 people

πŸ”— Mamba Explained: The State Space Model taking on Transformers

πŸ”— AI Hallucinations : Fear Not β€” It’s A Solved Problem β€” Here’s How (With Examples!)

πŸ”— Romanes Lecture: β€˜Godfather of AI’ speaks about the risks of artificial intelligence

πŸ”— OpenAI shocks the world yet again … Sora first look

πŸ”— Large Language Models: A Survey

πŸ”— Jeff Dean (Google): Exciting Trends in Machine Learning

πŸ”— Antagonistic AI

πŸ”— GPT4's system prompt was leaked

πŸ”— The AI bullshitΒ singularity

πŸ”— GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis

πŸ”— SORA Video To Video Is Literally Mind Blowing - 12 HD Demos - Changes Industry Forever For Real

πŸ”— OWASP LLM AI Security and Governance Checklist v1 (pdf)

πŸ”— Sam Altman Seeks Trillions of Dollars to Reshape Business of Chips and AI

πŸ”— Cory Doctorow: What Kind of Bubble is AI?

πŸ”— GeneGPT

πŸ”— I analysed 5M freelancing jobs to see what jobs are being replaced by AI

πŸ”— AI is the average of the Internet: WPP, Don't become an AI North Korea

πŸ”— Spreadsheets are all you need - Understanding GPT2 and Transformers with Spreadsheets

πŸ”— Deep Learning Discovers Antibiotics

πŸ”— Ancient Herculaneum scroll piece revealed by AI

πŸ”— GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting

πŸ”— Sora is a data-driven physics engine

πŸ”— Introduction to Matryoshka Embedding Models

πŸ”— The Age of Average

πŸ”— AI assistance is leading to lower code quality, claim researchers

πŸ”— Chain-of-Thought Reasoning Without Prompting

πŸ”— Machine Learning Research at Apple

πŸ”— Beyond Self-Attention: How a Small Language Model Predicts the Next Token

πŸ”— How I'd Learn AI (If I Had to Start Over)

πŸ”— Demis Hassabis on Chatbots to AGI | Hard Fork EP 71

πŸ”— Is OpenAI the next challenger trying to take on Google Search?

πŸ”— Is AI Actually Useful?

πŸ”— FCC Makes AI-Generated Voices in Robocalls Illegal

πŸ”— SORA

πŸ”— Reduce AI Hallucinations with Retrieval Augmented Generation

πŸ”— EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation


January 2024

πŸ”— The 6 Types of Conversations with Generative AI

πŸ”— The Frame Problem

πŸ”— RAG Using Unstructured Data & Role of Knowledge Graphs (part 2/2)

πŸ”— Embeddable AI with Nitro

πŸ”— Attacks on machine learning models

πŸ”— State-of-the-art Code Generation with AlphaCodium – From Prompt Engineering to Flow Engineering

πŸ”— Self-Consuming Generative Models Go MAD

πŸ”— The chaos inside OpenAI – Sam Altman, Elon Musk, and existential risk explained | Karen Hao

πŸ”— Seven Failure Points When Engineering a Retrieval Augmented Generation System

πŸ”— crewAI

πŸ”— AI Won’t Kill Our Jobs, It Will Kill Our Job Descriptionsβ€”and Leave Us Better Off

πŸ”— How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs

πŸ”— GenAI could make KYC effectively useless

πŸ”— OpenAI Cookbook

πŸ”— Torvalds Speaks: Impact of Artificial Intelligence on Programming

πŸ”— AI or Ain't: Eliza

πŸ”— Machine Learning Engineering Open Book

πŸ”— The Random Transformer: Understand how transformers work by demystifying all the math behind them

πŸ”— Coca-Cola’s accidentally terrifying Christmas card AI image generator

πŸ”— State of AI & predictions for 2024

πŸ”— Machine Learning Video Library

πŸ”— Mathematical Introduction to Deep Learning (Arxiv, 2310.20360)

πŸ”— Centaurs and Cyborgs on the Jagged Frontier

πŸ”— Ten Noteworthy AI Research Papers of 2023

πŸ”— Pushing ChatGPT's Structured Data Support To Its Limits

πŸ”— The Impact of Reasoning Step Length on Large Language Models

πŸ”— Sean Moriarty - The Future of Large Language Models is Elixir

πŸ”— DocLLM: A layout-aware generative language model for multimodal document understanding

πŸ”— Andrei Kovalev's Midlibrary

πŸ”— Direct Preference Optimisation: Your Language Model is Secretly a Reward Model

πŸ”— Stanford’s mobile ALOHA robot learns from humans to cook, clean, do laundry

πŸ”— Meta releases prompt engineering guide to help beginners and pros prompt like experts


Browse by Year