Ep 18: 8 Levels of AI Engineering, Meta AI Delays, and LLM Neuroanatomy
This week, Dan, Shimin & Rahul covers Meta's struggles with its delayed "Avocado" AI model and potential Gemini licensing, NVIDIA's enterprise-ready NemoClaw fork of OpenClaw, SWE-bench analysis showing PRs wouldn't pass human review, prompting superstitions and developer identity, the 8 levels of agentic engineering, mainstream media framing of AI coding, legal liability for agent-written code, and a deep dive into LLM neuroanatomy where a researcher topped leaderboards by repeating model layers without changing weights.
Takeaways:
Takeaways:
- Meta may end up licensing Gemini despite massive AI investment — mirroring Apple's path
- SWE-bench failures were mostly code quality, not functionality — suggesting good enough may be good enough with proper agents.md
- A coworker analyzed 4.5 years of PRs to create a personalized coding style document for AI priming
- The fastest software paradigm adoption cycle ever may be the claw/agent paradigm
- Legal frameworks and insurance haven't caught up to agent-written code shipping to production
- Repeating later model layers (the "thinking" layers) can boost performance without fine-tuning — raising questions about whether chain-of-thought reasoning is essentially exercising these layers repeatedly
- Developers compared to ancient Egyptian scribes — language literacy as leverage
Resources Mentioned
Meta Delays Rollout of New A.I. Model After Performance Concerns
NVIDIA NemoClaw
Research note: Many SWE-bench-Passing PRs Would Not Be Merged into Main
The Collective Superstitions of People Who Talk to Machines
The 8 Levels of Agentic Engineering
Coding After Coders: The End of Computer Programming as We Know It
Built by Agents, Tested by Agents, Trusted by Whom?
LLM Neuroanatomy: How I Topped the LLM Leaderboard Without Changing a Single Weight
Chapters
Connect with ADIPod
Meta Delays Rollout of New A.I. Model After Performance Concerns
NVIDIA NemoClaw
Research note: Many SWE-bench-Passing PRs Would Not Be Merged into Main
The Collective Superstitions of People Who Talk to Machines
The 8 Levels of Agentic Engineering
Coding After Coders: The End of Computer Programming as We Know It
Built by Agents, Tested by Agents, Trusted by Whom?
LLM Neuroanatomy: How I Topped the LLM Leaderboard Without Changing a Single Weight
Chapters
- (00:00) - Introduction to AI in Software Development
- (02:42) - Meta's AI Model Delays and Market Position
- (09:51) - NVIDIA's New AI Developments
- (13:58) - Benchmarking AI Models and Code Quality
- (19:00) - Techniques Corner: AI Prompting and Creativity
- (22:56) - The Evolution of Coding and Creativity
- (28:46) - Levels of Agentic Engineering
- (34:58) - Mainstream Perspectives on AI and Software Development
- (43:00) - Trusting AI-Generated Code
- (44:40) - Metrics for Success in Autonomous Teams
- (46:59) - Legal and Ethical Implications of Autonomous Code
- (50:21) - Innovations in Language Model Architectures
- (01:01:02) - User Experience Challenges in Tech Development
- (01:03:47) - Market Predictions and Financial Insights
Connect with ADIPod
- Email us at humans@adipod.ai you have any feedback, requests, or just want to say hello!
- Checkout our website www.adipod.ai