Ep 18: 8 Levels of AI Engineering, Meta AI Delays, and LLM Neuroanatomy

This week, Dan, Shimin & Rahul covers Meta's struggles with its delayed "Avocado" AI model and potential Gemini licensing, NVIDIA's enterprise-ready NemoClaw fork of OpenClaw, SWE-bench analysis showing PRs wouldn't pass human review, prompting superstitions and developer identity, the 8 levels of agentic engineering, mainstream media framing of AI coding, legal liability for agent-written code, and a deep dive into LLM neuroanatomy where a researcher topped leaderboards by repeating model layers without changing weights.

Takeaways:
  • Meta may end up licensing Gemini despite massive AI investment — mirroring Apple's path
  • SWE-bench failures were mostly code quality, not functionality — suggesting good enough may be good enough with proper agents.md
  • A coworker analyzed 4.5 years of PRs to create a personalized coding style document for AI priming
  • The fastest software paradigm adoption cycle ever may be the claw/agent paradigm
  • Legal frameworks and insurance haven't caught up to agent-written code shipping to production
  • Repeating later model layers (the "thinking" layers) can boost performance without fine-tuning — raising questions about whether chain-of-thought reasoning is essentially exercising these layers repeatedly
  • Developers compared to ancient Egyptian scribes — language literacy as leverage
Resources Mentioned
Meta Delays Rollout of New A.I. Model After Performance Concerns
NVIDIA NemoClaw
Research note: Many SWE-bench-Passing PRs Would Not Be Merged into Main
The Collective Superstitions of People Who Talk to Machines
The 8 Levels of Agentic Engineering
Coding After Coders: The End of Computer Programming as We Know It
Built by Agents, Tested by Agents, Trusted by Whom?
LLM Neuroanatomy: How I Topped the LLM Leaderboard Without Changing a Single Weight

Chapters
  • (00:00) - Introduction to AI in Software Development
  • (02:42) - Meta's AI Model Delays and Market Position
  • (09:51) - NVIDIA's New AI Developments
  • (13:58) - Benchmarking AI Models and Code Quality
  • (19:00) - Techniques Corner: AI Prompting and Creativity
  • (22:56) - The Evolution of Coding and Creativity
  • (28:46) - Levels of Agentic Engineering
  • (34:58) - Mainstream Perspectives on AI and Software Development
  • (43:00) - Trusting AI-Generated Code
  • (44:40) - Metrics for Success in Autonomous Teams
  • (46:59) - Legal and Ethical Implications of Autonomous Code
  • (50:21) - Innovations in Language Model Architectures
  • (01:01:02) - User Experience Challenges in Tech Development
  • (01:03:47) - Market Predictions and Financial Insights

Connect with ADIPod
  • Email us at humans@adipod.ai you have any feedback, requests, or just want to say hello! 
  • Checkout our website www.adipod.ai


Ep 18: 8 Levels of AI Engineering, Meta AI Delays, and LLM Neuroanatomy
Broadcast by