Skip to content

Latest commit

 

History

History
676 lines (465 loc) · 42.1 KB

README - RESOURCES_MIT.MD

File metadata and controls

676 lines (465 loc) · 42.1 KB

AI Development Resource Master List

This master list is a living document—updated as of February 2025—to serve as a comprehensive resource for AI development, research, and creative exploration. Use it as a roadmap to navigate the cutting‐edge tools, thought leaders, and emerging trends in the field. Contributions and improvements are welcome via pull requests.


🧭 Core Platform


🔍 AI Search Engines - The entry to AI for most people.

Engine Differentiation Link
Perplexity Academic-grade citations and in-depth analysis perplexity.ai
You.com Customizable AI agent ecosystem for personalized search you.com
Phind Developer-focused problem solving and technical queries phind.com
Komo Mind map visualization integrated with AI search komo.ai
Gemini Google’s multimodal AI offering next-generation reasoning gemini.google
Copilot Seamless integration with Microsoft's ecosystem for coding copilot.microsoft.com

Original Must-Follow AI Thought Leaders - a primer.

You should go on X and immediately follow every one of these people and hit the bell so that you will see every one of their posts. Afterwards, watch their posts for really serious information. When that information appears, look at the comments. Upon occasion, there will be folks in the comments who are really speaking deeply about the subject matter. Follow those people too. After a few months you will have a really amazing network of scientists in the field.

Account Focus/Role Key Contributions Why Follow Profile Link
Jim Fan (@DrJimFan) NVIDIA researcher in autonomous systems Blends neuroscience with robotics; develops adaptive control and sensor fusion algorithms for autonomous vehicles and robotics. Offers insights bridging robotics and neuromorphic computing. Profile
Tekinium (@Teknium1) NLP pioneer Explores transformer architectures and language model scaling; provides commentary on ethical and technical challenges in NLP. Delivers forward-thinking insights on the evolution and limitations of language models. Profile
Robert Scoble (@scobleizer) Tech futurist Documents and predicts emerging technology trends; has a history from covering Microsoft and Rackspace to current AI developments. Provides a long-term perspective on tech evolution, linking historical trends with current innovations. Profile
Logan Kilpatrick (@OfficialLoganK) Google AI Studio/Gemini lead, ex-OpenAI developer relations Leads initiatives that integrate advanced AI models into developer tools; bridges research and community engagement. Gives an insider view on how large companies are operationalizing AI research into products and services. Profile
Gary Marcus (@garymarcus) AI critic and alternative model advocate Critiques the limitations of pure deep learning; advocates for hybrid approaches combining deep learning with symbolic reasoning. Offers a balanced perspective by questioning hype and emphasizing robustness, interpretability, and safety in AI. Profile
Shaw (@shawmakesmagic) Founder of AI16Z and dev of the impressive ElizaOS agent framework, decentralized AI advocate Champions decentralized AI agents and open-source approaches, promoting alternative funding and development models outside traditional tech giants. Provides an alternative viewpoint to centralized AI development, useful for understanding emerging, distributed models. Profile
Yann LeCun (@ylecun) Meta Chief AI Scientist, CNN inventor Pioneered convolutional neural networks (CNNs) which revolutionized computer vision; continues to drive forward deep learning research and innovations. A foundational figure in deep learning whose work informs both academic research and practical applications. Profile
Andrew Ng (@AndrewYNg) AI educator and entrepreneur Co-founded Coursera, Google Brain, and deeplearning.ai; popularized AI through accessible online courses and impactful projects at major tech companies. Known for clear explanations of complex topics and for his vision of democratizing AI education globally. Profile
Ian Goodfellow (@goodfellow_ian) Inventor of Generative Adversarial Networks (GANs) Created GANs, sparking a revolution in generative models with applications in art, synthetic data generation, and creative AI. His work on GANs provides deep insights into generative AI and its creative potential. Profile
Sam Altman (@sama) CEO of OpenAI Leads OpenAI through transformative projects like GPT-4; drives discussions on AI safety, ethics, and responsible development. Offers high-level perspectives on both the promise and challenges of advanced AI systems, with a focus on ethical practices. Profile
John Carmack (@ID_AA_Carmack) VR/AI integration expert Transitioned from revolutionizing 3D graphics in gaming to exploring AI’s role in enhancing VR and simulation experiences. Ideal for those interested in the convergence of gaming, VR, and AI technologies. Profile
Fei-Fei Li (@drfeifei) Computer vision pioneer, co-founder of ImageNet Spearheaded the ImageNet project, which enabled breakthroughs in computer vision; advocates for human-centered and ethical AI development. Balances technical innovation with ethical leadership, providing a holistic view of AI’s impact on society. Profile
Meredith Whittaker (@mwhit) AI ethics and transparency advocate Co-founded the AI Now Institute; influences policy and corporate practices around AI ethics, accountability, and transparency. Essential for understanding the societal and ethical dimensions of AI development and use. Profile
Ilya Sutskever (@ilyasut) Co-founder & Chief Scientist at OpenAI Pioneer in deep learning and transformer models; instrumental in developing breakthrough models like the GPT series. Offers deep technical insights into model architecture innovations and the future trajectory of AI research. Profile
Lex Fridman (@lexfridman) AI researcher and podcaster Hosts in-depth interviews with top experts from academia and industry; bridges technical, philosophical, and ethical discussions in AI. Provides context and depth through engaging conversations that make complex topics accessible. Profile
François Chollet (@francois_chollet) Creator of Keras, deep learning researcher Developed Keras to simplify deep learning; offers thoughtful commentary on the theoretical and practical aspects of AI development. Balances technical expertise with philosophical insights, making his contributions valuable for both developers and theorists. Profile
Swyx (@swyx) Developer, AI content creator, and community luminary Recognized for his insightful commentary on AI trends, developer productivity, and the future of tech; runs an influential blog and shares actionable insights on integrating AI into everyday workflows. Offers a unique blend of technical depth and practical advice, making him invaluable for understanding both the creative and business aspects of AI. Profile

History, Architecture and use of LLMs.

Some visualizations:

  1. AI Timeline
    An interactive timeline documenting recent advancements in AI, covering cultural, political events, and technical breakthroughs.
    Visit AI Timeline

  2. LLM Visualization
    A 3D visualization and walkthrough of a GPT-style large language model, illustrating each computational step in generating a single token.
    Explore LLM Visualization

  3. Transformer Explainer
    An interactive tool that visually explains how transformer-based models, like GPT, process text to predict subsequent tokens.
    Check out Transformer Explainer

  4. Prompt Chaining
    A workflow where the output of one large language model call becomes the input for the next, enabling structured reasoning and step-by-step task completion.
    Learn about Prompt Chaining


🧠 Understanding AI Model Training & Model Sizes

What Does It Mean to Train a Model?

Training an AI model involves feeding it massive amounts of text, images, or other data so it can learn patterns, relationships, and context. The process includes:

  • Pretraining: The model learns general knowledge from large datasets.
  • Fine-tuning: The model is adjusted for specific tasks or domains.
  • Reinforcement Learning: Additional optimization methods, like RLHF (Reinforcement Learning from Human Feedback), improve its behavior.

What Do Model Sizes Mean?

AI models come in different sizes, typically measured in parameters (e.g., billions or trillions of values that help process and generate responses).

  • Base Model: The original model trained on general data before fine-tuning for specific tasks.
  • Quantized Models: Smaller, optimized versions of base models that reduce memory usage and improve efficiency (e.g., 4-bit or 8-bit quantization).
    • Smaller quantized models run faster and take up less space but may lose some accuracy.

How does retrieval augmented generation ("RAG") compare to fine-tuning? What is graph rag?

Retrieval-Augmented Generation (RAG)

RAG is an AI framework that enhances large language models (LLMs) by retrieving relevant external knowledge before generating responses. Instead of relying solely on pre-trained knowledge, a RAG model queries a database, knowledge graph, or document store to fetch relevant information, improving accuracy and reducing hallucinations.

Graph RAG

Graph RAG extends traditional RAG by structuring retrieved data as a graph, where entities, concepts, and relationships are explicitly modeled. This enables context-aware retrieval, improved reasoning, and better interpretability by leveraging structured knowledge representations like knowledge graphs, embeddings, or vector databases.

Key Differences

Feature RAG Graph RAG
Data Structure Unstructured text/documents Structured graph of entities/relationships
Retrieval Method Semantic search, embeddings Context-aware graph traversal, embeddings
Use Cases General knowledge retrieval, chatbots, search augmentation Scientific research, enterprise knowledge graphs, complex reasoning

🚀 Modern AI Models & Their Differences

1. OpenAI - ChatGPT Models

ChatGPT-4o ("Omni")

  • OpenAI’s flagship model, optimized for reasoning, multimodal inputs (text, image, code), and real-time responses.
  • Training: Uses a mix of supervised fine-tuning and RLHF.
  • Speed & Efficiency: Faster and cheaper than previous GPT-4 models.
  • Usage: Available via ChatGPT, API, and integrated into Microsoft Copilot.

GPT-4o-1 (Mini)

  • A smaller, optimized version of GPT-4o designed for lower latency and reduced compute requirements.
  • Training: Retains core ChatGPT-4o capabilities but fine-tuned for lightweight performance.

Explore ChatGPT


2. Anthropic - Claude Models

Claude Opus

  • The most powerful Claude model, excelling in reasoning, creativity, and extensive memory recall.
  • Training: Constitutional AI ensures ethical responses with structured safety tuning.
  • Usage: Best for research, coding, and deep contextual understanding.

Claude Sonnet

  • A mid-tier model balancing performance and speed, offering high-quality responses with lower cost.

Claude Haiku

  • The fastest and most lightweight Claude model, optimized for efficiency over depth.

Try Claude


3. Google - Gemini Models

Gemini 1.5 Pro

  • Next-gen multimodal model trained for reasoning across text, image, and video.
  • Training: Uses Google's Mixture of Experts (MoE) technique, optimizing token efficiency.
  • Strengths: Handles massive context windows for long-document understanding.

Gemini "Thinking Model"

  • A rumored new variant focused on deep logical reasoning and complex problem-solving.
  • Expected to improve autonomous multi-step planning capabilities.

Explore Gemini


4. Meta - Llama Models

Llama 3 and up (Latest Meta AI Models)

  • Meta's latest open-weight LLM, optimized for efficiency and scalability.
  • Training: Uses a combination of publicly available datasets and proprietary fine-tuning.
  • Strengths: Competitive performance across reasoning, code generation, and multilingual understanding.
  • Variants:
    • Smaller versions optimized for on-device AI (e.g., mobile or edge computing).
    • Larger versions optimized for server-level inference.

Check Llama Models


5. Mistral - Codestral and Mathstral

  • A leading open-source model designed for high efficiency and strong performance.
  • Training: Uses a dense transformer architecture, fine-tuned for reasoning and coding.
  • Strengths: Competitive with proprietary models like GPT-4 in certain benchmarks.

Check Mistral


6. DeepSeek

  • A high-performing open-source model optimized for reasoning and efficiency.
  • Training: Trained on multilingual and coding-heavy datasets.
  • Variants: Includes DeepSeek models fine-tuned for both text and multimodal inputs.

DeepSeek AI | DeepSeek on Hugging Face


7. Qwen (By Alibaba)

  • An open-source LLM designed for multilingual understanding and complex reasoning.
  • Training: Optimized for efficiency and available in various sizes.
  • Use Cases: Performs well in knowledge tasks, chatbot applications, and code generation.

Explore Qwen

💥 Alibaba's Direct Challenge to Big AI Players

  • Alibaba Announced Qwen 2.5 1M

    • Features a 1-million token context window, a direct affront to Google's Gemini models.
  • Alibaba Announced Qwen 2.5-VL

    • Can control PCs and phones, a direct challenge to Claude's "Computer Use" capabilities.

📢📢📢📢 AI Model Advancements - The Last Week of Shocks 📢📢📢📢

AI Market Disruptions – A Timeline

In about a week, these blows were cast:

  • DeepSeek R1 chat model released. Blows away benchmarks, and the API costs 1/10 of ChatGPT.
  • Widespread press campaign claimed R1 had super-low production costs, and exhibited GPT-4o quality.
  • Result was market crash for all tech!
  • R1 rumored to have Robotics training
  • → Next, DeepSeek released "v3" chat model
  • Widespread press campaign claimed super-low production costs, and it exhibited GPT-4o quality.
  • → Next, "AGI BOT" demo released, claiming low production costs and "more human" movement than any competitor.
  • "AGI BOT" demo says DeepSeek R1 can be used in it for real-time decision making and complex problem solving.
  • → Next, Alibaba announced Qwen 2.5 1M - which has a 1 million token context window.... a direct affront to Google's Gemini models.
  • → Next, Alibaba announced Qwen 2.5-VL, which can control PCs and phones.... a direct affront to Claude's Computer Use.

...and as a coincidence, Alibaba and DeepSeek are headquartered in the same city, roughly 10-20 mins drive from each other.
Probably nothing. 🤔



🤔 Coincidence?

  • Alibaba and DeepSeek are both headquartered in the same city, just 10-20 minutes apart.
  • Probably nothing... or maybe everything.

💻 AI AI-Infused Coding Tools & Core Development Environments

(This list is not exhaustive.)

Below is a consolidated list of AI-enhanced IDEs and core development tools, each with a brief description and a link to get started.

- AI-imbued IDEs:

  1. Cursor
    AI-native IDE with integrated pair programming, intelligent code generation, and smart autocompletion built on a familiar VS Code framework.
    Visit Cursor

  2. VS Code (with Continue extension)
    A versatile code editor enhanced by the Continue extension for AI-assisted multi-file editing and context-aware code suggestions.
    Download VS Code (Install the Continue extension via the Marketplace)

  3. RepoPrompt
    An AI-powered IDE available via Apple’s TestFlight that offers advanced code assistance and interactive project previews on iOS.
    Join via TestFlight

  4. Windsurf (By Codeium)
    An AI-empowered IDE optimized for the Claude model, delivering context-aware code suggestions and multi-file editing; requires an API key.
    Visit Windsurf

  5. Cline
    A VS Code plugin tailored for developers using Anthropic’s Claude, recently updated to offer robust AI-driven coding assistance (API key required).
    Learn about Cline

  6. Google Project IDX
    Google’s cloud-based IDE built on VS Code, featuring integrated emulators, unique templates, and generative AI assistance for streamlined development.
    Sign up for Project IDX

- "One-Shot Full Stack App Creation Agents"

  1. Replit
    A cloud-based, collaborative IDE with built-in AI coding tools and instant deployment capabilities across multiple languages.
    Start Coding on Replit

  2. bolt.new
    A rapid prototyping tool offering an AI-powered coding environment that lets you instantly create and iterate on projects with minimal setup.
    Try bolt.new

  3. Lovable.dev
    An AI-enhanced developer platform that accelerates your workflow with smart code generation and interactive project demos.
    Explore Lovable.dev

  4. Llamacoder by Together.ai
    An AI-driven coding assistant that leverages large language models to deliver intelligent code completions and debugging support.
    Visit Llamacoder

  5. Vercel V0
    An AI-powered code generation tool that helps developers quickly build UI components and full-stack applications using natural language prompts, seamlessly integrating with frameworks like Next.js.
    Explore Vercel V0

- AI-assisted command line:

  1. Warp Terminal
    An AI-assisted command line that transforms traditional terminal workflows with collaborative features and natural language interactions.
    Discover Warp Terminal

  2. Aider https://aider.chat/

- LLM Playgrounds:

AI Playgrounds Comparison

Below is a list of AI playgrounds where you can experiment with different models beyond ChatGPT and Perplexity.

Service Link Models Available Key Features
OpenAI Playground OpenAI GPT-4, GPT-3.5 Fine-tuning, API testing, various temperature settings.
Google Gemini AI Studio Google AI Studio Gemini 1.5, Gemini 1 Free-tier access, large token windows, API integration.
Claude by Anthropic Claude Claude 3 models Focus on safety and accuracy, high-context retention.
Mistral AI Playground Mistral AI Mistral 7B, Mixtral Open-weight models, efficient inference.
Hugging Face Spaces & Transformers Hugging Face LLaMA, Falcon, BLOOM, and more Experimentation with thousands of models, fine-tuning options.
Cohere Playground Cohere Command R+, Embed, Classify Strong focus on embeddings and text classification.
AI21 Labs Jurassic Playground AI21 Studio Jurassic-2 models Fine-grained control over generation, long-form text support.
Together AI Playground Together AI Mistral, LLaMA, Falcon Free access to multiple open-source models.
Vercel AI SDK Playground Vercel AI Playground OpenAI, Anthropic, and custom models Multi-provider integration, AI SDK testing.
Mozilla Llama Files Llama Files Standalone LLaMA models No-server, offline model deployment.

Feel free to contribute to this list by submitting a pull request!

- AI workspace integrations:

  1. Microsoft Copilot
    Integrated across Microsoft’s ecosystem (VS Code, GitHub, Microsoft 365), this AI coding assistant offers real-time code completion and collaborative development support.
    Learn about Microsoft Copilot

  2. GitHub Codespaces
    A cloud-based development environment that lets you instantly spin up a complete dev environment integrated with GitHub and VS Code, with built-in support for AI tools like Copilot.
    Explore GitHub Codespaces

  3. Gitpod
    An automated, cloud-based IDE that provides instant, on-demand development environments to streamline coding workflows and collaboration.
    Visit Gitpod

🛠️ Open Source AI Tools

Below is a selection of open-source AI tools, each with a brief description and a link to get started.

  1. Ollama

Ollama streamlines the deployment and management of LLMs, making it a valuable tool for developers and researchers interested in leveraging open-source AI models locally.

  1. LM Studio
    LM Studio allows users to discover, download, and run local large language models (LLMs) entirely offline, ensuring privacy and control over data. It supports various architectures, including Llama, Mistral, Phi, Gemma, DeepSeek, and Qwen.
    Visit LM Studio

  2. AnythingLLM
    An all-in-one AI application that enables users to chat with documents, utilize AI agents, and more, all running locally and offline. It supports custom models and various document types, ensuring full privacy.
    Visit AnythingLLM

  3. Jan
    An open-source alternative to ChatGPT, Jan runs entirely offline on your device, allowing users to download and run large language models with full control and privacy.
    Visit Jan

  4. ComfyUI
    A powerful open-source node-based application for creating images, videos, and audio with generative AI. It allows users to design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface.
    Visit ComfyUI

  5. Qwen (By Alibaba)
    An open-source LLM designed for multilingual understanding and complex reasoning. Optimized for efficiency and available in various sizes. Performs well in knowledge tasks, chatbot applications, and code generation.
    Explore Qwen

  6. DeepSeek
    An open-source AI model developed by a Chinese startup, DeepSeek offers efficient performance comparable to leading models like OpenAI's ChatGPT, but at a fraction of the cost. It is available for use on the web, via app, and through an API.
    Visit DeepSeek | DeepSeek on Hugging Face


🎨 Creative AI Toolkit

Visual Media

Tool Capabilities Link
MidJourney Premium text-to-image generation midjourney.com
Krea Real-time AI canvas for creative explorations krea.ai
Ideogram Typography-focused image generation ideogram.ai
DALL·E 3 Latest OpenAI text-to-image generator with improved resolution and style control labs.openai.com
Runway ML Multimodal video editing and AI-assisted content creation runwayml.com

Audio/Video Production

  1. Suno
    AI music generation with customizable compositions
    suno.ai

  2. Udio
    Video editing automation using AI-driven processing
    udio.com

  3. Descript
    Text-based editing for audio and video, including transcription and overdubbing
    descript.com

  4. Adobe Suite
    AI-enhanced creative tools across Photoshop, Premiere Pro, and more


WHAT ARE AGENTS?

An agent in artificial intelligence is an autonomous system that perceives its environment through sensors, processes this information, and takes actions via actuators to achieve specific goals. These agents operate independently, making decisions based on their perceptions and predefined objectives. oai_citation_attribution:0‡en.wikipedia.org

Key facets of AI agents include:

  1. Autonomy: The ability to function without human intervention, controlling their actions and internal states independently.

  2. Reactivity: The capacity to perceive and respond promptly to changes in the environment.

  3. Proactiveness: The initiative to pursue goal-directed behavior by taking the necessary steps to achieve objectives.

  4. Social Ability: The capability to interact with other agents or humans, often through communication or collaboration.

These facets enable AI agents to perform a wide range of tasks, from simple automated functions to complex decision-making processes, across various applications.


🧠 Agentic Frameworks Overview

1. chat.dev

  • Description: An early platform enabling developers to create AI-driven conversational agents. It provided foundational tools for building and deploying chatbots across various applications.

2. Microsoft AutoGen

  • Description: An open-source framework by Microsoft designed for building AI agents that can autonomously or collaboratively perform tasks. AutoGen supports multi-agent interactions and integrates with Microsoft's Semantic Kernel for enhanced AI capabilities.

  • Key Features:

    • Facilitates deterministic and dynamic agentic workflows.
    • Supports research on multi-agent collaboration.
    • Enables distributed agents for multi-language applications.
  • Resources:

3. Virtuals Protocol

  • Description: A decentralized framework that allows for the creation, deployment, and governance of AI-powered virtual entities. Virtuals Protocol enables co-ownership and tokenization of AI agents, expanding their revenue potential across applications via blockchain technology.

  • Key Features:

    • Provides a complete suite of tools for creating AI agents on the Base blockchain.
    • Offers sophisticated APIs for monetization through advertising and partnerships.
    • Incorporates an innovative token-burning system to redistribute revenue to token holders.
  • Resources:

4. Zerebro

  • Description: An emerging agentic framework focusing on creative output and social media automation. Zerebro aims to provide tools for developers to build AI agents capable of handling complex tasks in creative industries.

  • Key Features:

    • Emphasizes creative applications and social media interactions.
    • Offers tools for automating content creation and management.
  • Resources:

5. Centience

  • Description: A framework designed to enhance the cognitive capabilities of AI agents, enabling them to perform tasks requiring higher-order reasoning and decision-making. Centience focuses on creating agents that can understand and adapt to complex environments.

  • Key Features:

    • Enhances cognitive functions of AI agents.
    • Facilitates higher-order reasoning and adaptive decision-making.
  • Resources:

6. Eliza Framework by AI16z

  • Description: Developed by AI16z, the Eliza Framework is a leading agentic platform that has gained significant traction in the AI community. Named after one of the first chatbot programs, Eliza provides a robust infrastructure for building and deploying AI agents across various applications.

  • Key Features:

    • First-Mover Advantage: As an early entrant in the agentic framework space, Eliza has established a strong presence and community support.
    • TypeScript-Based: Built using TypeScript, making it accessible for web developers and ensuring seamless integration with web applications.
    • Multi-Agent Simulation: Supports the creation of complex simulations involving multiple interacting agents, suitable for modeling social interactions and other dynamic systems.
    • Cross-Platform Compatibility: Designed to work across various platforms, enhancing its versatility in deployment.
  • Market Position:

    • Holds approximately 60% market share among agentic frameworks, attributed to its early entry and comprehensive feature set.
    • Valued at around $900 million, reflecting its significant impact and adoption in the AI industry.
  • Community and Ecosystem:

    • Boasts a vibrant GitHub community with over 6,000 stars and 1,800 forks, indicating active development and collaboration.
    • Plans to launch an Eliza-based agent Launchpad, aiming to establish AI16z as the base currency for agent-to-agent transactions and the "App Store" for agent services.
  • Resources:

The Eliza Framework's robust architecture and active community make it a compelling choice for developers looking to build sophisticated AI agents. Its focus on multi-agent simulation and cross-platform compatibility positions it well for a variety of applications, from social interaction modeling to complex system simulations.

These frameworks represent the evolving landscape of agentic AI development, each offering unique features and capabilities to cater to diverse application needs.


📚 Foundational Resources

  1. ArXiv
    Open-access research repository for the latest preprints
    arxiv.org
    Pro Tip: Use arxiv-sanity.com for refined search

  2. Hugging Face
    Model hub with over 200k AI assets and active community support
    huggingface.co

  3. Papers with Code
    Repository linking research papers with their open-source implementations
    paperswithcode.com



▶️ Implementation Checklist

  1. Follow all recommended X accounts with notifications enabled
  2. Bookmark and review the AI coding tools section
  3. Experiment with at least one creative tool per week
  4. Compare features and performance of three AI search engines
  5. Next-Level Additions (For the Overachievers):

🌌 Emerging Frontiers

  1. AI Safety/Alignment

  2. Neuromorphic Chips

    • Intel Loihi 2
    • BrainChip Akida
  3. Open-Source Moats

    • Mistral 8x22B
    • Llama-3-400B
  4. AI x Biology

⚡ Lightning Round: 2024 Breakouts

Tool Purpose Why It Matters
Lume 3D asset generation Unreal Engine integration
Synthesia AI avatars $90M Series C @ $1B valuation
Imbue Agentic coding YC’s largest-ever AI investment

📜 Message to Future AI Leaders

From Your (Soon-to-be) Peers

  1. "The best way to predict the future is to leak the alpha."
    — Start open-sourcing your experiments now

  2. "If you aren’t embarrassed by your first AI agent, you shipped too late."
    — Iterate faster than GPT-4’s token generation

  3. "The next Yann LeCun isn’t optimizing LLMs—they’re reinventing backprop."
    — Go deep on fundamentals

Hard Truths

  • 💡 The real edge isn’t in using AI tools—it’s in building them
    (See: PyTorch 2.0 internals, CUDA optimization techniques)
  • ⚖️ Ethics ≠ bureaucracy
    Your model weights will shape society. Read more
  • 🔥 The “GPT-4 moment” for robotics is ~18 months out
    Start collecting sensor datasets now – keep an eye on initiatives like Nvidia Cosmos.

📖 Mandatory Reading (Beyond Papers)

  1. Books
    • The Coming Wave by Mustafa Suleyman
    • A Thousand Brains by Jeff Hawkins
    • Human Compatible by Stuart Russell (bonus for understanding AI alignment)
  2. Threads & Manifestos

🌍 Where to Go Next

  • Conferences:
    NeurIPS (Dec), CVPR (June), AI Engineer Summit, MIT EmTech, and AI Frontiers
  • Hackathons:
    ETHDenver, AI Village @ DEFCON
  • Fellowships & Workshops:
    Explore programs from MIT Media Lab, Stanford HAI, and Carnegie Mellon’s AI initiatives

"The future is already here—it’s just not evenly distributed yet. Go redistribute it."


🧪 Advanced Challenges

  1. Break Groq’s LPU:
    Benchmark inference speeds against CUDA/TensorRT on modern Apple Silicon devices.
  2. Hack Anthropic’s Constitution:
    Find adversarial prompts to test and improve Claude’s safeguards.
  3. Beat AlphaFold 3:
    Use OpenFold to predict a novel protein structure.
  4. Spoof GPT-5:
    Fine-tune Llama-3-400B to mimic rumored capabilities and generate unexpected behaviors.
  5. Optimize NVIDIA Blackwell Inference:
    Develop custom algorithms to maximize performance on the new Blackwell architecture.

📚 Additional Resources

  • Microsoft Azure AI Studio:
    Build and deploy generative AI applications using Azure AI Services and OpenAI’s GPT-4o.
  • Apple Intelligence:
    Explore AI integration on Apple devices running iOS 18, iPadOS 18, and macOS Sequoia via Apple Intelligence.
  • Google Scholar:
    For academic research and citation tracking, visit scholar.google.com.

This master list is intended to empower developers, researchers, and creatives to harness the full potential of AI. Always verify the latest updates from official sources and feel free to contribute improvements to this evolving document.

Happy innovating!


In about a week, these blows were cast:

  • DeepSeek R1 chat model released. Blows away benchmarks, and the API costs 1/10 of chatGPT.
  • Widespread press campaign claimed R1 had super-low production costs, and exhibited GPT o1 quality.
  • Result was market crash for all tech!
  • R1 rumored to have Robotics training
  • --> next, DeepSeek released "v3" chat model
  • Widespread press campaign claimed super-low production costs, and it exhibited GPT 4o quality.
  • --> next, "AGI BOT" demo released, claiming low production costs and "more human" movement than any competitor
  • "AGI BOT" demo says DeepSeek R1 can be used in it for real-time decision making and complex problem solving
  • --> next, Alibaba announced Qwen 2.5 1M - which has a 1Mil token context window.... a direct affront to Google's Gemini models.
  • --> next, Alibaba announced Qwen 2.5-VL, which can control PCs and phones.... a direct affront to Claude's Computer Use.

...and as a coincidence, Alibaba and Deepseek are headquartered in the same city, roughly 10-20 mins drive from each other. Probably nothing.