• The Current ⚡️
  • Posts
  • DeepSeek Has Shaken The “Bigger Is Better” Presumption for AI Building To The Core

DeepSeek Has Shaken The “Bigger Is Better” Presumption for AI Building To The Core

Also, Qwen follows R1 release with advanced vision models

⚡️ Headlines

🤖 AI

Five Things Most People Don't Seem to Understand About DeepSeek – Gary Marcus lays it out for us. [Marcus on AI].

DeepSeek Hit with Large-Scale Cyberattack, Says It's Limiting Registrations – Chinese AI startup DeepSeek reports significant cyberattacks disrupting new user registrations, though existing users remain unaffected. [CNBC].

China's DeepSeek Sets Off AI Market Rout – The launch of China's DeepSeek AI assistant leads to a global tech stock selloff, with Nvidia experiencing a record market-cap loss. [Reuters].

DeepSeek vs. ChatGPT: Hands-On with DeepSeek's R1 Chatbot – A comparison between DeepSeek's R1 chatbot and OpenAI's ChatGPT reveals DeepSeek's innovative training methods and potential market disruption, despite common AI challenges. [Wired].

Startup Perplexity Offers Uncensored DeepSeek AI Search – Perplexity introduces an uncensored AI search engine named DeepSeek, aiming to transform user information access. [The Information].

Building Toward a Smarter, More Personalized Assistant – Meta announces advancements in its AI assistant, focusing on personalized experiences by remembering user preferences and providing tailored recommendations. [Meta].

🤳 Social Media

TikTok Launches 2025 Marketing Calendar – TikTok releases its 2025 marketing calendar to help brands optimize their seasonal campaigns and maximize engagement on the platform. [Social Media Today].

⚖ Legal

Trump Vows Near-Future Tariffs, Calls DeepSeek Progress 'Good' – President Trump announces plans for imminent tariffs on imported computer chips and pharmaceuticals, while acknowledging China's DeepSeek AI as a positive development. [Bloomberg].

🎱 Random

How SoftBank's Son Made It Back to the White House – SoftBank CEO Masayoshi Son plans a $40 billion investment in the Stargate data center project, marking a significant return to U.S. tech initiatives. [The Information].

The Americans Pledging to Buy Less—or Even Nothing – Amid rising prices and household debt, a growing number of Americans commit to minimal or no new purchases, focusing on financial responsibility and debt reduction. [The Wall Street Journal].

🔌 Plug-Into-This

DeepSeek, a Chinese AI startup, has released a groundbreaking AI model, R1, that performs comparably to OpenAI's cutting-edge technologies while using only a fraction of the computational and financial resources. This development challenges the long-standing industry belief that larger models, backed by massive scale, are the key to progress in AI.

  • Paradigm Shift: The success of DeepSeek's R1 model disrupts the notion promoted by industry leaders like Sam Altman of OpenAI that AI performance improves predictably with scale, such as by adding more computational power, data, and infrastructure.

  • Resource Efficiency: Unlike U.S. AI giants investing billions in GPUs and data centers, DeepSeek achieved its advancements on a relatively modest budget, showcasing the potential of smarter, more efficient methodologies over sheer size.

  • Market Disruption: Nvidia, a major supplier of GPUs driving the AI revolution, experienced a significant financial impact, with its market valuation dropping by $600 billion following the news of DeepSeek's efficiency-driven breakthroughs.

DeepSeek's viral moment has raised critical questions about future AI development strategies especially around potential cost reductions, and call into more serious question the broader environmental and economic impacts of resource-intensive AI scaling efforts that have been the norm so far in the US.

Qwen has unveiled Qwen2.5-VL, its latest vision-language model, marking a significant advancement from the previous Qwen2-VL. The model is available in three configurations—3B, 7B, and 72B—and can be accessed via Qwen Chat, Hugging Face, and ModelScope.

  • Enhanced Visual Understanding: Qwen2.5-VL excels in recognizing a wide array of objects, including flora, fauna, and various products. It also adeptly analyzes complex visual elements such as text, charts, icons, graphics, and layouts within images.

  • Agentic Capabilities: The model functions as a visual agent capable of reasoning and dynamically directing tools, facilitating interactions with devices like computers and smartphones.

  • Advanced Video Comprehension: Qwen2.5-VL can understand videos exceeding one hour in length and is equipped to pinpoint relevant segments, enhancing event detection within video content.

  • Precise Visual Localization: The model accurately identifies objects within images, generating bounding boxes or points, and provides stable JSON outputs detailing coordinates and attributes.

  • Structured Data Output: For documents such as invoices, forms, and tables, Qwen2.5-VL supports structured content extraction, benefiting applications in finance and commerce.

👁️ Qwen2.5-VL's release demonstrate rapid progression in vision-language models, highlighting the integration of comprehensive visual understanding with dynamic tool interaction.

DeepSeek has introduced Janus Pro, a new family of multimodal AI models designed to generate images from textual descriptions. The company asserts that Janus Pro outperforms existing models like OpenAI's DALL-E 3.

  • Model Variants: Janus Pro is available in two configurations: 1B and 7B parameters, catering to different performance and resource requirements.

  • Enhanced Image Generation: The models are trained to produce high-quality images based on textual prompts, aiming to improve upon the capabilities of current text-to-image generators.

  • Accessibility: DeepSeek has made Janus Pro models accessible to developers and researchers, promoting integration into various applications and further innovation in the field.

🔥 Following up their viral moment last week with this additional launch has DeepSeek cranking up the heat even more on American Big Tech firms. Is there anything is company can’t do? When is the bottom going to drop out on this hype — and on whom does it drop?

 🆕 Updates

📽️ Daily Demo

🗣️ Discourse