synesis
Chain-of-Thought, by Way of 4chan
AI
LLMs
links
Apple
reasoning
history
“Verified” Doesn’t Mean “Nothing Can Go Wrong”
AI
security
links
Claude
Lean
formal verification
software engineering
CoachGPT, in Practice
AI
LLMs
running
personal
generative AI
future of work
links
Why LLMs Still Stumble Over Time
AI
LLMs
reasoning
temporal reasoning
commonsense
evaluation
research
iPhone, Artemis II, Moon
Apple
space
links
iPhone
Artemis II
moon
history
Filesystems vs. RAG
AI
AI engineering
agentic systems
RAG
LLMs
generative AI
links
The Revenge of the Data Scientist
AI
AI engineering
agentic systems
LLMs
generative AI
hallucination
evaluation
links
Artemis II Launches on Apple’s 50th Anniversary
Apple
space
history
Artemis II
moon
Apollo
links
Learning to Reason in 13 Parameters
AI
LLMs
reasoning
generative AI
fine-tuning
links
Snowflake Cortex AI Escapes Sandbox and Executes Malware
AI
agentic systems
security
AI safety
prompt injection
links
Language Puzzles, NACLO, and a Note of Thanks
linguistics
AI
NACLO
computational linguistics
education
personal
links
Journey into Coding with AI [3/4]: Decision-Bound Programming
AI
coding
software engineering
AI engineering
journey series
Knuth’s Hamiltonian Cycles, Solved by Claude
AI
mathematics
Mathematica
Claude
generative AI
links
Woodinville Half: HM #76, the Comeback
running
half marathon
video
personal
race
The ‘CS Exodus’ as Discipline Evolution
AI
computer science
education
research
history
links
Anthropic on AI Coding: Scaffold, Not Substitute
AI
coding
software engineering
developer productivity
learning
Anthropic
links
Humans Need Rest, Even When Machines Don’t
AI
coding
software engineering
future of work
burnout
links
Inside the M5: As Big as New Jersey
Apple
semiconductor
iPhone
video
links
Bridle Trails 50K: Hot Cocoa, Sprained Ankle, No Horseshit
running
50K
ultra
trail running
video
personal
race
Parkrun Turkey Trot: 22:09 on Thanksgiving
running
Parkrun
5K
personal
race
“Skills” Are Not Software Engineering
AI
LLMs
agentic systems
software engineering
AI engineering
Claude
Anthropic
9th 50K Long Run: Redmond Library to the Seattle Office
running
ultra
long run
video
personal
Vibe Engineering
AI
agentic systems
software engineering
coding
AI engineering
links
A Pure Republic of Classical Music
Apple
Apple Music
classical music
music
AI
generative AI
links
Journey into Coding with AI [2/4]: Shifting Gears
AI
coding
software engineering
AI engineering
automation
journey series
Journey into Coding with AI [1/4]: Running Back to Code
AI
coding
software engineering
AI engineering
journey series
Embedding Limits: A Linear-Algebra Note (and Kernel Tricks)
AI
LLMs
RAG
embeddings
retrieval
RecSys
search
paper
research
Redmond Harvest Half: HM #64, 1:40:45
running
half marathon
personal
race
Embeddings Hit a Theoretical Ceiling
AI
LLMs
RAG
embeddings
retrieval
RecSys
search
agentic systems
paper
links
State of AI in Business 2025: Why 95% Get Zero P&L
AI
agentic systems
paper
future of work
links
Mid-Year Running Recap: PRs at Every Distance
running
half marathon
marathon
Parkrun
video
personal
MSR: Which Occupations GenAI Is Actually Used In
AI
generative AI
future of work
jobs
paper
links
How Anthropic Teams Use Claude Code
AI
coding
software engineering
agentic systems
Anthropic
Claude
generative AI
links
20 Years of Podcasts on iTunes
Apple
podcasting
history
links
For Alfred Brendel
music
classical music
Apple Music
personal
Karpathy: Software Is Changing (Again)
AI
AGI
agentic systems
talk
video
generative AI
links
ICR² Accepted to ACL 2025 Findings
AI
LLMs
RAG
retrieval
NLP
research
conference
paper
Mill Town Marathon: 3:24:01, Closer to BQ
running
marathon
personal
race
World’s Fastest 10K + Three PRs in Two Weeks
running
10K
5K
half marathon
Parkrun
personal
race
Apple Music Classical on the Web
Apple
Apple Music
classical music
music
links
Rancho San Antonio Trail Run, Cupertino
running
trail running
video
personal
DeepSeek vs ChatGPT on Ethical Questions
AI
ChatGPT
DeepSeek
generative AI
ethics
links
End of 2024: 2,375 Miles, 18 Marathons, 4 50Ks
running
marathon
50K
ultra
year in review
video
personal
Sutskever at NeurIPS 2024: Pre-Training Era Is Over
AI
generative AI
LLMs
NLP
reasoning
conference
video
links
Seattle Marathon 2024: 18th Marathon, GAP PR
running
marathon
half marathon
Parkrun
5K
personal
race
Parkrun PR + 34-Mile Ultra
running
Parkrun
5K
ultra
long run
video
personal
race
20 Seconds of Thinking, 100,000× More Data
AI
generative AI
LLMs
reasoning
OpenAI
links
How Well Can Transformers Build World Models?
AI
LLMs
world models
reasoning
transformers
paper
research
6th Parkrun PR + 2024 Stretch Goals Done
running
Parkrun
5K
marathon
half marathon
personal
race
Crossing Lake Washington: A 33.01-Mile Run
running
ultra
50K
long run
video
personal
Time-Sensitive Knowledge Editing via Efficient Fine-Tuning (ACL 2024)
AI
LLMs
NLP
knowledge editing
fine-tuning
conference
paper
research
10K for 7 Days at NAACL 2024 (Mexico City)
running
conference
NAACL
NLP
video
personal
TWEAK at NAACL 2024: Decoding Without Hallucinations
AI
LLMs
NLP
hallucination
knowledge graphs
conference
paper
research
generative AI
Are Researchers Using LLMs to Write Their Papers?
AI
LLMs
NLP
research
paper
generative AI
AI on Trial: Legal Models Hallucinate in 1 out of 6 Queries
AI
LLMs
NLP
hallucination
RAG
paper
generative AI
links
Two K2T Papers Accepted: TWEAK at NAACL, LAGRANGE at LREC-COLING (2024)
AI
LLMs
NLP
knowledge graphs
hallucination
generative AI
conference
paper
research
reasoning
Generative AI Seeped into Research Peer Reviews
AI
LLMs
NLP
research
paper
generative AI
links
St Patrick’s Day at Parkrun
running
Parkrun
personal
race
KnowledgeableLMs Workshop @ ACL 2024 — CFP
NLP
LLMs
knowledge graphs
RAG
research
conference
links
RAG with Knowledge Graphs: Key Open Questions
AI
LLMs
NLP
RAG
knowledge graphs
generative AI
links
First Parkrun of 2024 + Year’s Goals
running
Parkrun
marathon
half marathon
personal
race
ChatGPT Bombs Test on Diagnosing Kids’ Medical Cases
AI
LLMs
NLP
hallucination
generative AI
links
Learning from Tragedies: NLP Beyond LLMs
AI
LLMs
NLP
research
paper
evaluation
links
“Hallucinate”: Cambridge Dictionary’s 2023 Word of the Year
generative AI
LLMs
NLP
hallucination
links
Large Language Models as Sleuths
AI
LLMs
NLP
security
research
paper
generative AI
links
FLEEK: Fact Verification with LLMs and Knowledge Graphs
AI
LLMs
NLP
knowledge graphs
hallucination
conference
paper
generative AI
links
Neuroscience for Machine Learners
neuroscience
machine learning
education
links
Catching a Lying LLM
AI
LLMs
NLP
hallucination
knowledge graphs
paper
generative AI
links
From ‘Reversal Curse’ to Teaching Large Language Models New Facts
AI
LLMs
NLP
knowledge graphs
fine-tuning
paper
generative AI
links
LAGRANGE: Cyclic Evaluation for KG-Text Datasets
AI
LLMs
NLP
knowledge graphs
evaluation
paper
generative AI
Thinking in a Foreign Language Improves Decision-Making
AI
LLMs
NLP
linguistics
generative AI
links
ICL Still Loses to Fine-Tuning at Named Entity Recognition
AI
LLMs
NLP
knowledge graphs
fine-tuning
paper
links
Douglas Lenat, Who Tried to Make Computers More Human, Dies at 72
AI
commonsense
knowledge graphs
history
links
When Computers Write Proofs, What’s the Point of Mathematicians?
AI
mathematics
reasoning
formal verification
Lean
links
video
Give Us the Facts: Large Language Models vs. Knowledge Graphs
AI
LLMs
NLP
knowledge graphs
evaluation
paper
generative AI
LLM-Generated Code Has a Serious API Misuse Problem
AI
LLMs
code generation
software engineering
paper
generative AI
NLP
Model Editing: Performing Digital Brain Surgery
AI
LLMs
NLP
knowledge editing
paper
generative AI
LLMs Stumble Hard on Counterfactual Reasoning
AI
LLMs
reasoning
NLP
paper
generative AI
Faithful Text Generation from Knowledge Graphs with Noisy References
AI
knowledge graphs
NLP
paper
generative AI
Thousands of Scientists Are Leaving Twitter for Mastodon
science
social media
Mastodon
links
What Learning Algorithm Is In-Context Learning?
LLMs
ICLR 2023
in-context learning
deep learning
generative AI
AI
NLP
paper
How “Attention Is All You Need” Was Born: The Story Behind the Transformer Paper
NLP
research
history
deep learning
diversity
transformers
AI
The Dutch Government Joins Mastodon
Mastodon
European Commission
social media
links
Scientific Writing in the Age of Generative AI
writing
generative AI
ChatGPT
research
NLP
Do LLMs Really Understand? Recent Papers Reveal
AI
LLMs
reasoning
causal reasoning
code generation
NLP
paper
generative AI
How the Brain Processes German and Arabic Differently
NLP
multilingual
neuroscience
neurolinguistics
ICL Demonstration Selection and Disentangling Task Recognition from Task Learning
LLMs
GPT-3
NLP
paper
deep learning
domain adaptation
in-context learning
prompt engineering
Asimov, the Original Prompt Engineer
sci-fi
generative AI
NLP
robotics
prompt engineering
GPT’s Causal Reasoning Scores May Reflect Memorization, Not Reasoning
reasoning
GPT-4
ChatGPT
LLMs
NLP
paper
generative AI
AI
Rise of the Newsbots
generative AI
NLP
ethics
AI
links
LLMs, Copyrighted Training Data, and Fair Use
LLMs
copyright
GPT-3
BLOOM
AnthropicLM
Cohere
Codex
DMCA
generative AI
law
ethics
NLP
paper
Generative AI Is Powerful but Not Without Many Flaws
generative AI
LLMs
AI
ethics
NLP
Do Labels in ICL Demonstrations Actually Matter?
LLMs
GPT-3
paper
NLP
NLG
generative AI
in-context learning
Why Does In-Context Learning Work? Gradient Descent and PAC Learnability
generative AI
GPT-4
LLMs
deep learning
machine learning
paper
NLP
NLG
in-context learning
GPTs are GPTs: Labor Market Impact of Large Language Models
OpenAI
GPTs
LLMs
jobs
future of work
generative AI
NLP
NLG
ethics
paper
The Stupidity of AI (The Guardian)
The Guardian
generative AI
ethics
AI
computer vision
multimodal
NLP
Indirect Prompt Injection Threats
prompt injection
NLP
NLG
security
generative AI
The Waluigi Effect in LLMs
ChatGPT
Bing
Waluigi effect
LLMs
prompt engineering
NLP
NLG
ethics
TruthfulQA: Are Larger LLMs More Truthful?
LLMs
ACL 2022
GPT-3
ChatGPT
question answering
generative AI
NLP
NLG
deep learning
paper
ChatGPT Heralds an Intellectual Revolution
WSJ
ChatGPT
generative AI
AI
deepfakes
education
Homo Technicus
deep learning
ethics
future
NLP
NLG
The Profound Danger of Conversational AI
Bing
conversational AI
ChatGPT
deep learning
NLP
Bing is a Shapeshifter
Bing
ChatGPT
NLP
NLG
deep learning
LLMs
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT
multitask
multilingual
multimodal
evaluation
ChatGPT
SOTA
LLMs
reasoning
commonsense
NLG
NLP
NLU
deep learning
AI
paper
How Close Is ChatGPT to Human Experts? Human Evaluations and Detection
ChatGPT
AI
question answering
NLP
deep learning
paper
How Apple Is Organized for Innovation
Apple
Innovation
organizational structure
ChatGPT Banned on StackOverflow
ChatGPT
StackOverflow
education
recruiting
publishing
law
NLP
NLG
AI
ethics
AI safety
New Home on Mastodon
Mastodon
social media
personal
Fun with DALL-E 2 and Semantics Leakage
DALLE2
semantics
deep learning
multimodal
NLP
paper
My Love Letter to Macintosh
Apple
Macintosh
history
CMU
Linux
Mathematica
NLU & Reasoning: Distributional Semantics and the Road to Foundation Models
NLU
foundation models
stable diffusion
huggingface
NLP
knowledge graphs
deep learning
machine learning
AI
multimodal
paper
First Day at Apple
first day
knowledge graphs
AI
deep learning
machine learning
conversational AI
program synthesis
How to Solve AI’s Common Sense Problem
AI
NLP
NLU
commonsense
knowledge graphs
links
IBM Neuro-Symbolic AI Summer School Day 2: Question Answering (Pavan Kapanipathi)
NLP
NLU
knowledge graphs
conference
paper
IBM Neuro-Symbolic AI Summer School Day 1/2: AMR and MRS (Rademaker, Astudillo)
NLP
NLU
linguistics
knowledge graphs
conference
paper
One Day, A Computer Will Fit On A Desk (1974)
history
future
video
links
Liang on KG + MLM @ NAACL 2022
NLP
NLU
knowledge graphs
foundation models
conference
NAACL
Curriculum NLI @ NAACL 2022: Where Models Fail
NLP
NLU
knowledge graphs
NAACL
conference
paper
Muhao Chen on Robust IE @ NAACL 2022
NLP
knowledge graphs
conference
NAACL
The Curious Case of Control
NLP
linguistics
deep learning
foundation models
paper
links
Foundation Models: Opportunities and Risks
AI
NLP
foundation models
ethics
paper
links
DI-2021 @ KDD 2021: Heng Ji Talk Recording
NLP
knowledge graphs
conference
biomedical
DI-2021 @ KDD 2021: Heng Ji Announce
NLP
knowledge graphs
conference
biomedical
NAACL 2001 T-Shirt Design
NLP
NAACL
CMU
history
personal
The Unlikely Pioneer Behind mRNA Vaccines
biomedical
science
podcasting
links
First Dose of COVID Vaccine
biomedical
science
personal
Genius Maker: Hinton’s Auction
AI
deep learning
history
On the Dangers of Stochastic Parrots
NLP
NLG
NLU
ethics
paper
links
No matching items