OpenAI Cries Foul on DeepSeek

Also, US Copyright Office publishes report on AI generated works

⚡️ Headlines

🤖 AI

DeepSeek's R1 Model Challenges U.S. AI Dominance – DeepSeek's R1 model, developed using older Nvidia chips, rivals U.S. AI models, raising concerns about American competitiveness. [The Verge]

DeepSeek App Removed from Apple and Google Stores in Italy – Italian authorities have taken down the DeepSeek app from major app stores amid data privacy concerns. [Reuters]

On DeepSeek and Export Controls – Dario Amodei discusses the implications of DeepSeek's advancements on U.S. export control policies, emphasizing the need for maintaining technological advantages. [Dario Amodei]

Chinese and Iranian Hackers Exploit U.S. AI Products – Hackers from China and Iran are leveraging U.S. AI technologies to enhance their cyberattack capabilities. [The Wall Street Journal]

Mobile AI Apps Market Surges to $2 Billion, Led by ChatGPT – The mobile AI applications market has reached $2 billion, with ChatGPT leading the growth. [Bloomberg]

OpenAI Accuses DeepSeek of Data Misappropriation – OpenAI alleges that DeepSeek may have used its data without authorization, highlighting concerns over data usage in AI development. [404 Media]

DeepSeek's Censorship of Tiananmen Square Raises Concerns – DeepSeek's AI model has been found to censor discussions about Tiananmen Square, prompting debates over content moderation in AI. [The Wall Street Journal]

Meta Considers Integrating Chinese AI Models into Advertising Tools – Meta is exploring the use of Chinese AI models in its generative AI tools for advertisers, signaling a potential shift in its AI strategy. [The Information]

🦾 Emerging Tech

Trump Media Launches Fintech Services Brand – Trump Media and Technology Group has announced the launch of Truth.Fi, a fintech platform focusing on cryptocurrency and customized ETFs. [Reuters]

🤳 Social Media

The Rise of the 'Influencer Voice' on TikTok – A new speaking style, dubbed the 'influencer voice,' characterized by uptalk and vocal fry, is becoming prevalent among TikTok creators. [Fast Company]

⚖ Legal

Meta Agrees to Pay $25 Million to Settle Trump Lawsuit – Meta has agreed to a $25 million settlement with former President Trump over allegations of impermissible censorship following the suspension of his accounts post-January 6 riot. [The Wall Street Journal]

Trump Administration Considers Tightening Curbs on Nvidia's China Sales – Officials are discussing additional restrictions on Nvidia's sales of artificial intelligence chips to China, aiming to limit technological advancements by potential adversaries. [Bloomberg]

U.S. Copyright Office Clarifies Stance on AI in Filmmaking – The U.S. Copyright Office has stated that using AI tools in the filmmaking process does not undermine copyright protection, provided human creativity remains central. [Variety]

🎱 Random

MGM Agrees to $45 Million Settlement for Data Breach Victims – MGM Resorts International has agreed to pay $45 million to settle multiple class-action lawsuits related to data breaches in 2019 and a ransomware attack in 2023. [The Record]

🔌 Plug-Into-This

Microsoft and OpenAI are investigating whether a group associated with Chinese AI startup DeepSeek improperly accessed OpenAI’s data. The inquiry focuses on whether DeepSeek-affiliated individuals obtained proprietary outputs, raising concerns about security breaches and competitive intelligence gathering in the AI sector.

  • DeepSeek’s Strategic Positioning: The startup has rapidly positioned itself as a serious AI contender, with its latest model, DeepSeek V3, outperforming competitors and topping the U.S. iOS App Store’s AI assistant rankings. This rapid ascent has intensified scrutiny over its data sources and development practices.

  • Resource Efficiency as a Competitive Edge: DeepSeek claims to have trained its model on a fraction of the hardware and budget that U.S. tech giants require, raising questions about how it achieved such efficiency and whether external influences played a role.

  • Market Disruptions and AI Rivalries: The rise of DeepSeek has rattled major players, contributing to stock market reactions that saw declines in Nvidia, Microsoft, Alphabet, and semiconductor firms deeply invested in AI. The broader industry implications are fueling tensions over proprietary AI advancements.

  • Allegations of Data Misappropriation: Reports suggest that DeepSeek may have employed a technique known as "distillation," where one AI model learns from another by querying it extensively, effectively extracting and replicating its knowledge base. This method, if confirmed, could indicate that DeepSeek leveraged OpenAI's outputs to enhance its own models without authorization.

😏 Sweet, sweet irony? This is exemplary of the underlying paradox within the AI industry — OpenAI, which has been criticized for training its models on vast amounts of internet data without explicit permissions, now finds itself potentially victimized by similar practices. This cycle prompts a reevaluation of data ownership, ethical training practices, and the need for clearer guidelines in the AI community.

The U.S. Copyright Office has published a report addressing the copyrightability of works produced using generative artificial intelligence (AI). The report examines how existing copyright principles apply to AI-generated content and provides guidance on authorship and protection.

  • Human Authorship Requirement: The report reaffirms that copyright protection is granted only to works created by human authors. AI-generated content without meaningful human involvement does not qualify for copyright protection.

  • AI as an Assistive Tool: When AI is used under human guidance, the resulting work may be eligible for copyright protection, but only if there is sufficient human creativity in the final output. Mere reliance on AI-generated suggestions is unlikely to meet this threshold.

  • Role of Prompts: The specificity and originality of user prompts given to AI systems play a critical role. Detailed prompts reflecting a user's creative intent can contribute to human authorship, but simply entering generic commands does not establish ownership.

  • Modifications and Arrangements: Human alterations or arrangements of AI-generated material can qualify for copyright protection if they demonstrate original authorship. This suggests that AI-assisted works might still be protected if substantial human creativity is evident in post-processing or curation.

  • Legal and Policy Considerations: The report urges lawmakers, courts, and industry stakeholders to continue refining legal frameworks to balance innovation with intellectual property protection.

🤖 What even is “original authorship” in the AI era? Some things, like detailed prompting, have potential for ownership and protection. But as we know, prompting evolves quickly as models shift in the way they receive prompts based on the way average people use them. The real challenge still lies in defining the line between human-driven creativity and machine-generated output.

Researchers at UC Berkeley have developed TinyZero, a language model achieving GPT-3 level performance with only 0.1% of GPT-3's model size. This breakthrough demonstrates that large-scale language model capabilities can be replicated with significantly smaller models, potentially reducing computational costs and resource requirements even further.

  • TinyZero employs advanced model compression techniques, including knowledge distillation and parameter pruning, to achieve its compact size without sacrificing performance.

  • The model maintains high accuracy across various natural language processing tasks, such as text generation, translation, and comprehension.

  • Training TinyZero required innovative optimization algorithms to ensure efficiency and effectiveness in learning from large datasets.

  • The development emphasizes the importance of model efficiency, addressing concerns about the environmental impact and accessibility of large-scale AI models.

🔍 This advancement opens avenues for deploying sophisticated language models on devices with limited computational resources, broadening the applicability of AI technologies.

 🆕 Updates

📽️ Daily Demo

🗣️ Discourse