Skip to content
View DrejcPesjak's full-sized avatar

Highlights

  • Pro

Block or report DrejcPesjak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DrejcPesjak/README.md

Today's AI News

Todays Image

Summary: AI Reddit Recap (March 5, 2025)

Major Themes:

  • Model advancements: New and updated models like Qwen/QwQ-32B, Chroma, and GPT-4.5 are generating buzz.
  • Performance optimization: TeaCache speeds up WAN 2.1 by 100%, while LTX-Video gains keyframe and resolution support.
  • Accessibility and limitations: OpenAI's GPT-4.5 rollout to Plus users faces debate regarding rate limits and clarity.

Key Highlights:

  • QwQ-32B promises to outperform previous models and potentially rival even larger models like 671B.
  • Chroma, an open-sourced model, is trained on uncensored data and focuses on overcoming censorship challenges.
  • GPT-4.5 is now available to Plus users with enhanced memory capabilities but faces limitations on message count.
  • TeaCache significantly boosts the performance of WAN 2.1, offering a 100% speed increase.
  • LTX-Video adds keyframe interpolation and video extension features, enhancing its capabilities.

Other notable discussions:

  • The versatility of llama.cpp for local LLM configuration and management.
  • The potential impact of TeaCache on the future of model development.
  • The humor and satire surrounding GPT-4.5's energy consumption claims.
  • The limitations and potential for forthcoming updates related to GPT-4.5.

Pinned Loading

  1. DPhate-double-paraphrasing-hate-speech DPhate-double-paraphrasing-hate-speech Public

    Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate

    Python

  2. scaling-monosemanticity-llama scaling-monosemanticity-llama Public

    Reproducing Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet using LLaMA. This project explores monosemantic neurons in large language models, implementing and extend…

    Jupyter Notebook 4

  3. Herz-bot Herz-bot Public

    A qlearning model for the card game called Herz.

    Java

  4. unbalanced-media unbalanced-media Public

    Analysis of Unbalanced Slovenian Media News Outlets - Left vs. Right Wing

    Python

  5. weather-prediction-mlops weather-prediction-mlops Public

    ML in the cloud project for the universtiy course Cloud Computin (RSO)

    Jupyter Notebook

  6. nyc-violation-tickets-analysis nyc-violation-tickets-analysis Public

    Analysis and prediction of NYC violation tickets using big data and machine learning techniques.

    Jupyter Notebook