Lag0s

Week Summary

Technology

Earth has captured a temporary 'second moon,' a small asteroid named 2024 PT5, which will orbit until November 2024.

Research indicates that larger AI chatbots are increasingly prone to generating incorrect answers, raising concerns about their reliability.

Meta's Chief Technical Officer discussed advancements in AR and VR technologies, particularly focusing on the Orion AR glasses.

The author reflects on their experience with Rust, proposing several changes to improve the language's usability and safety features.

The Tor Project and Tails OS have merged to enhance their efforts in promoting online anonymity and privacy.

OpenAI is undergoing leadership changes, with key executives departing amid discussions about restructuring and the company's future direction.

Git-absorb

The concept of critical mass explains how significant changes occur when a threshold of acceptance is reached, impacting technology and society.

WordPress.org has banned WP Engine from accessing its resources due to ongoing legal disputes, raising concerns about security for WP Engine customers.

PostgreSQL 17

Hotwire Native is a web-first framework that simplifies mobile app development, allowing developers to reuse HTML and CSS across platforms.

Radian Aerospace is progressing on a reusable space plane, completing ground tests and aiming for full-scale flights by 2028.

A groundbreaking diabetes treatment using reprogrammed stem cells has enabled a patient to produce insulin independently for over a year.

Apple is developing a new home accessory that combines features of the iPad, Apple TV, and HomePod, expected to launch in 2025.

SpaceX's Starlink service is set to surpass 4 million subscribers, reflecting rapid growth and significant revenue projections.

TinyJS is a lightweight JavaScript library that simplifies dynamic HTML element creation and DOM manipulation for developers.

Anthropic's Claude 3 surpasses OpenAI's GPT-4 on Chatbot Arena.
Anthropic's Claude 3 Opus has surpassed OpenAI's GPT-4 for the first time on Chatbot Arena. Chatbot Arena is a leaderboard run by the Large Model Systems Organization, a research organization dedicated to open models. Its site allows visitors to rate outputs from various models, enabling it to calculate the best models in aggregate. While Claude's rise is notable, GPT-4 is now over a year old.
Hi Impact
Anthropic Claude 3
OpenAI GPT-4
Thursday, March 28, 2024
Anthropic introduces Claude 3 models, matching GPT4 and enhancing coding capabilities.
Anthropic has trained three new models in the Claude 3 family, the strongest of which matches GPT4’s reported benchmark results. It is also a multimodal model and performs well on vision tasks. Importantly, Claude's coding ability is substantially improved with this release.
Hi Impact
Anthropic Claude 3 AI
Tuesday, March 5, 2024
Claude 3 successfully summarizes a video into a blog post, showcasing its long context capabilities.
Andrej Karpathy issued a long context challenge to make a blog post from a recent video of his. Claude 3 was able to perform this task with some data pre-processing help. The resulting blog post is high quality and interesting.
Md Impact
Anthropic Claude 3 AI
Anthropic's Claude 3 AI model offers 'warmth' and naturalistic interaction for creative writing.
Anthropic's new AI model, Claude 3, stands out for its 'warmth,' making it a robust partner in creative writing tasks. Claude 3 is described as more human-feeling and naturalistic, crossing the threshold from good thought to deep thought made enjoyable. Despite technical benchmarks not fully capturing this nuance, Claude 3 is poised to revolutionize how we interact with AI in creative processes.
Md Impact
Anthropic Claude 3 AI Innovation
Anthropic launches Claude 3 Haiku, its fastest and most cost-effective AI model.
Anthropic's Claude 3 Haiku is its fastest and most cost-effective AI model. It features advanced vision capabilities and excels in benchmarks. Claude 3 Haiku is designed for enterprises, with a focus on speed and affordability.
Hi Impact
Anthropic Claude 3 Haiku
Anthropic releases a prompt library for Claude 3, offering effective user prompts for various tasks.
The release of Claude 3 has been quite popular, but the prompting style for these models is slightly different. Anthropic has collected a set of user prompts that work well for a wide variety of tasks and topics.
Hi Impact
Anthropic Claude 3 AI
Anthropic develops technique to jailbreak long context models.
Anthropic developed a technique to jailbreak long context models. It has shared these findings with other organizations and implemented mitigations. This post outlines the technique and some of the things it did to defend against the technique.
Md Impact
Anthropic
AI Security
AI startups, including Inflection AI, Stability AI, and Anthropic, face financial challenges despite the industry's growth.
High-profile AI startups like Inflection AI, Stability AI, and Anthropic are facing financial pressures as they struggle with the high costs of developing generative AI models. While OpenAI, backed by Microsoft, has shown revenue growth, competitors like Anthropic and Stability AI grapple with substantial gaps between revenue and operating expenses. Microsoft's investment in AI hints at the tech industry's belief in AI's long-term profitability, despite the current challenges in monetizing these expensive technologies.
Hi Impact
Inflection AI
Stability AI
Anthropic
OpenAI
Anthropic expands its AI assistant Claude to Europe and starts fundraising.
Anthropic has expanded its AI assistant, Claude, to Europe. Claude supports multiple languages. Anthropic is offering the service across its website, iOS app, and business plans for teams. The company is beginning the process of raising more money.
Hi Impact
Anthropic Claude Europe AI
Instagram co-founder Mike Krieger joins Anthropic as chief product officer.
Mike Krieger, one of the co-founders of Instagram, is Anthropic's new chief product officer. Krieger spent the last few years working on an AI news-reading app that was recently acquired by Yahoo. His background in developing intuitive products and user experiences will be invaluable for Anthropic as it creates new ways for people to interact with its AI chatbot Claude.
Md Impact
Anthropic
Mike Krieger
Instagram
Anthropic's research offers insights into AI decision-making processes.
Anthropic recently published a public research paper explaining why its AI chatbot chooses to generate content about certain subjects over others. Its researchers deciphered what parts of the chatbot's neural network mapped to specific concepts using a process known as 'dictionary learning'. The research showed how neurons associated with a topic fired together when the model was thinking about something associated with the topic - similar sets of neurons firing can evoke adjacent subjects. A link to the paper is available at the end of the article.
Md Impact
Anthropic Artificial Intelligence
Anthropic introduces a Responsible Scaling Policy to enhance AI safety.
Anthropic's Responsible Scaling Policy aims to prevent catastrophic AI safety failures by identifying high-risk capabilities, testing models regularly, and implementing strict safety standards, with a focus on continuous improvement and collaboration with industry and government.
Hi Impact
Anthropic AI Safety
Anthropic introduces method to map and interpret Claude Sonnet LLM for safer AI.
Anthropic researchers have unveiled a method to interpret the inner workings of its large language model, Claude Sonnet, by mapping out millions of features corresponding to a diverse array of concepts. This interpretability could lead to safer AI by allowing specific manipulations of these features to steer model behaviors. The study demonstrates a significant step in understanding and improving the safety mechanisms of AI language models.
Hi Impact
Anthropic Claude Sonnet AI Safety
Former OpenAI researcher Jan Leike joins Anthropic to lead AI safety team.
Jan Leike, a former OpenAI researcher who resigned over AI safety concerns, has joined Anthropic to lead a new "superalignment" team focusing on AI safety and security. Leike's team will address scalable oversight, weak-to-strong generalization, and automated alignment research.
Hi Impact
Anthropic Jan Leike AI Safety
Anthropic introduces tool use feature for Claude AI to create personalized bots.
Anthropic is introducing a "tool use" feature for its Claude AI chatbot, enabling users to create personalized assistants that can interact with any external API. This feature can analyze data, provide product recommendations, track orders, offer technical support, and even process images for applications like interior design.
Hi Impact
Anthropic Claude AI AI
Claude's AI character is designed with a specific personality using Constitutional AI.
Claude is more than just a middle-of-the-road, sycophantic AI that agrees with the user. Claude's personality and character have been specifically designed using a character variant of Constitutional AI. This post goes in-depth on how post-training is used to steer the type of output often generated by Claude to represent this desired character.
Hi Impact
Anthropic Claude AI
Anthropic launches Claude 3.5 Sonnet with improved performance and new features.
Anthropic has launched Claude 3.5 Sonnet, boasting better performance than GPT-4o and Gemini on several benchmarks alongside increased speed and expanded capabilities. The update also introduces the Artifacts feature, enhancing user interaction with the AI's outputs. Claude aims to transition from a chatbot to a central tool in business environments, integrating knowledge and workflow management.
Hi Impact
Anthropic
Claude 3.5 Sonnet
Anthropic introduces prompt engineering features in Claude for AI app development.
Anthropic's new features in Claude allow developers to automate prompt engineering, improving AI application development by generating, testing, and refining prompts with quick feedback.
Hi Impact
Anthropic Claude AI
Major AI companies accused of using YouTube videos without consent for AI training.
Several major AI companies, including Anthropic, Nvidia, Apple, and Salesforce, used subtitles from 173,536 YouTube videos across 48,000 channels to train AI models, despite YouTube's rules against unauthorized data harvesting. This has sparked backlash from content creators, who argue that their work has been exploited without consent or compensation, raising concerns about AI's impact on the creative industry and the ethics of using such training data.
Hi Impact
Apple
Nvidia
Anthropic
Salesforce
AI Ethics
Anthropic partners with Menlo Ventures to launch a $100M fund for AI startups, offering funding, AI model access, and more.
Anthropic, in partnership with Menlo Ventures, is launching a $100 million Anthology Fund to support AI startups. The fund will provide at least $100k per startup, access to Anthropic's AI models, and various other perks such as networking opportunities and workspace access.
Hi Impact
Anthropic AI Startup Fund
Anthropic faces criticism for excessive web scraping by its ClaudeBot, disrupting Freelancer.com and iFixit.
Freelancer.com and iFixit CEOs criticized Anthropic for excessive and unauthorized web scraping by its ClaudeBot, which disrupted their sites. Anthropic claims to respect the robots.txt file and says it will investigate the issue.
Hi Impact
Anthropic ClaudeBot Technology
Reddit CEO demands Microsoft and others to pay for data scraping, with some companies refusing to negotiate.
Reddit's CEO is calling on Microsoft and other companies to pay if they want to continue scraping the site's data. The site is now blocking companies that haven't signed agreements about how its data will be used or not used. Companies like Microsoft, Anthropic, and Perplexity have refused to negotiate. OpenAI's SearchGPT will be able to show Reddit results as the two companies reached a deal earlier this year.
Hi Impact
Reddit
Microsoft
Anthropic
Perplexity
SearchGPT
Anthropic's Claude models now feature prompt caching to reduce costs and latency, enhancing AI-powered features.
Anthropic has introduced prompt caching for its Claude models, allowing developers to cache frequently used context, significantly reducing costs and latency, with early users like Notion already benefiting from faster and more efficient AI-powered features.
Hi Impact
Anthropic Claude AI Development
Anthropic updates Claude with new system prompts, enhancing its capabilities.
Anthropic has added system prompts and updated dates for all models.
Md Impact
Anthropic Claude AI & Technology
Anthropic's Artifacts now available on mobile.
Anthropic has made Artifacts generally available, including on mobile.
Hi Impact
Anthropic Artifacts Technology
A 3-minute read on Anthropic's publication of system prompts for its Claude AI models.
Anthropic has published the system prompts used to guide its Claude AI models and plans to continue being transparent moving forward.
Hi Impact
Anthropic
Claude AI
Artificial Intelligence
Anthropic releases Quickstart Repo with scalable customer service agent project, in collaboration with AI leaders.
Anthropic has released a useful set of starter projects. It has partnered with former heads of AI from Brex, Uber, Facebook, and others to help write the first Quickstart, a scalable customer service agent powered by Claude.
Hi Impact
Anthropic Quickstart Repo AI
Mike Krieger, Anthropic's CPO and Instagram co-founder, discusses future AI products and the vision for Claude, a leading AI model.
This article contains an interview with Mike Krieger, the new chief product officer at Anthropic, where he discusses why he decided to work in AI, what products he sees AI will bring in the future, and how he is thinking about building them. Krieger co-founded Instagram. He left Meta in 2018 and launched an AI-powered news reader, which was shut down earlier this year. Anthropic was started in 2021 by former OpenAI executives and researchers. Its main product is Claude, an industry-leading AI model and chatbot that competes with ChatGPT.
Hi Impact
Anthropic Claude Mike Krieger AI
Anthropic's new method for semantically chunking documents dramatically improves performance at low cost.
Anthropic shows how to semantically chunk documents, which dramatically improves performance while only costing $1/million chunks due to caching.
Hi Impact
Anthropic Document Processing
Exploring HTTP Streaming APIs of Large Language Models
Simon Willison explores the mechanics of HTTP streaming APIs from various large language model (LLM) providers, detailing how they function and providing practical examples of their usage. The primary focus is on the commonalities among these APIs, which utilize the `text/event-stream` content type to facilitate real-time data streaming. Each API sends data in blocks separated by double newlines, with each block containing a JSON line prefixed by `data:`. Notably, the EventSource API in browsers cannot be used directly with these APIs since they typically employ POST requests instead of GET. Willison provides a specific example using OpenAI's API, demonstrating how to send a prompt to the GPT-4o Mini model and request a streaming response. The command utilizes `curl` with the `--no-buffer` option to ensure that the output is displayed in real-time as it is received. The response includes various data chunks, each representing parts of the model's output, along with metadata about token usage. He then examines the Anthropic Claude API, which also supports streaming responses. The example shows how to send a similar prompt and receive a structured response that includes event types such as `message_start`, `content_block_start`, and `content_block_delta`, indicating the progression of the response. The Google Gemini API is discussed next, highlighting its ability to return larger token chunks. Willison demonstrates this by prompting for a longer joke, which results in multiple parts being streamed back in succession. In addition to the API examples, Willison shares code snippets for accessing these streaming responses using Python's HTTPX library and JavaScript's Fetch API. The Python example employs asynchronous programming to handle the streaming data, while the JavaScript example uses asynchronous iterators to process incoming events. Overall, the exploration provides a comprehensive look at how different LLM APIs implement streaming, the structure of their responses, and practical coding examples for developers looking to integrate these capabilities into their applications.
OpenAI
Anthropic
Google

Month Summary

Technology

OpenAI is considering a new subscription model for its upcoming AI product, Strawberry, while also restructuring for better financial backing.

Telegram founder

The startup landscape is shifting towards more tech-intensive ventures, with a focus on specialized research and higher capital requirements.

Boom Supersonic's XB-1 demonstrator aircraft successfully completed its second flight, testing new systems for future supersonic travel.

announced the uncrewed return of Boeing's Starliner, with future crewed missions planned for 2025.

OpenAI's SearchGPT aims to compete with Google Search by providing AI-driven information retrieval, though it currently faces accuracy issues.

Tesla is preparing to unveil its autonomous robotaxi technology at an event in Los Angeles, indicating ongoing challenges in achieving full autonomy.

The US Department of Justice is investigating Nvidia for potential antitrust violations related to its AI chip market dominance.

Apple plans to use OLED screens in all iPhone 16 models, moving away from Japanese suppliers and introducing new AI features.

Amazon S3 has introduced conditional writes to prevent overwriting existing objects, simplifying data updates for developers.

Chinese scientists have developed a hydrogel that shows promise in treating osteoarthritis by restoring cartilage lubrication.

Nvidia's CEO is working to position the Nvidia as a comprehensive provider for data center needs, amidst growing competition from AMD and Intel.

OpenAI

Nvidia Blackwell

Amazon is set to release a revamped Alexa voice assistant in October, powered by AI models from Anthropic's Claude, and will be offered as a paid subscription service.