Categoria: AI

Auto Added by WPeMatico

  • Elon Musk’s AI company, xAI, launches an API for Grok 3

    Billionaire Elon Musk has just been countersued by OpenAI, but that isn’t stopping his AI company, xAI, from making its flagship Grok 3 model available via an API. It has been several months since xAI unveiled Grok 3, the company’s answer to models like OpenAI’s GPT-4o and Google’s Gemini.  Grok 3 can analyze images and respond to questions, […]
  • Wayve’s self-driving tech is headed to Nissan vehicles

    Nissan said it will use self-driving software developed by Wayve to beef up its advanced driver assistance system starting in 2027, a landmark deal for the buzzy UK startup that has raised more than $1.3 billion from Nvidia, Microsoft, SoftBank Group, and Uber.  Nissan will integrate Wayve’s software as well as sensors, including lidar from an undisclosed […]
  • Deep Cogito emerges from stealth with hybrid AI ‘reasoning’ models

    A new company, Deep Cogito, has emerged from stealth with a family of openly available AI models that can be switched between “reasoning” and non-reasoning modes. Reasoning models like OpenAI’s o1 have shown great promise in domains like math and physics, thanks to their ability to effectively fact-check themselves by working through complex problems step […]
  • Meta got caught gaming AI benchmarks

    Over the weekend, Meta dropped two new Llama 4 models: a smaller model named Scout, and Maverick, a mid-size model that the company claims can beat GPT-4o and Gemini 2.0 Flash “across a broad range of widely reported benchmarks.”

    Maverick quickly secured the number-two spot on LMArena, the AI benchmark site where humans compare outputs from different systems and vote on the best one. In Meta’s press release, the company highlighted Maverick’s ELO score of 1417, which placed it above OpenAI’s 4o and just under Gemini 2.5 Pro. (A higher ELO score means the model wins more often in the arena when going head-to-head with competitors.)

    The achievement seemed to position Meta’s open-weight Llama 4 as a serious challenger to the state-of-the-art, closed models from OpenAI, Anthropic, and Google. Then, AI researchers digging through Meta’s documentation discovered something unusual.

    In fine print, Meta acknowledges that the version of Maverick tested on LMArena isn’t the same as what’s available to the public. According to Meta’s own materials, it deployed an “experimental chat version” of Maverick to LMArena that was specifically “optimized for conversationality,” Te …

    Read the full story at The Verge.

  • Gemini Live’s screensharing feature is rolling out to Pixel 9 and Galaxy S25 devices

    Just press the cooresponding button in Gemini Live to activate video or screenshare.

    Google’s Gemini Live camera and screenshare functions, which let the conversational AI chatbot answer questions about the stuff you’re looking at in real-time, are rolling out now on Pixel 9 series phones and Samsung Galaxy S25 devices. The free update is also coming soon to other Android devices, but you’ll need to be a paid Gemini Advanced user to gain access.

    Once available on your device, you can activate the live video function at the push of a button and ask Gemini Live questions about whatever your camera can see. As demonstrated in Google’s April Pixel Drop video, you can do things like point the camera at an aquarium tank and ask Gemini Live questions about specific fish. You can also tap the new screenshare button, show Gemini Live a shopping website, and ask the AI assistant to compare products or provide styling advice.

    Last month, Google spokesperson Alex Joseph confirmed to The Verge that the features started rolling out to customers, and some users on Reddit confirmed it appeared on their devices, including on a Xiaomi phone. The video and screensharing in Gemini Live were first demonstrated in May at Google’s I/O developer conference as part of “Project Astra.”

    Gemini Live is available in 45 languages in select countries for users 18 years of age and older (excludes education and enterprise accounts).

  • Google is allegedly paying some AI staff to do nothing for a year rather than join rivals

    Retaining top AI talent is tough amid cutthroat competition between Google, OpenAI, and other heavyweights. Google’s AI division, DeepMind, has resorted to using “aggressive” noncompete agreements for some AI staff in the U.K. that bar them from working for competitors for up to a year, Business Insider reports. Some are paid during this time, in […]
  • Amazon says its AI video model can now generate minutes-long clips

    Amazon has upgraded its AI video model, Nova Reel, with the ability to generate videos up to two minutes in length. Nova Reel, announced in December 2024, was Amazon’s first foray into the generative video space. It competes with models from OpenAI, Google, and others in what’s fast becoming a crowded market. The latest Nova […]
  • Meta’s benchmarks for its new AI models are a bit misleading

    One of the new flagship AI models Meta released on Saturday, Maverick, ranks second on LM Arena, a test that has human raters compare the outputs of models and choose which they prefer. But it seems the version of Maverick that Meta deployed to LM Arena differs from the version that’s widely available to developers. […]
  • Microsoft releases AI-generated Quake II demo, but admits ‘limitations’

    Microsoft has released a browser-based, playable level of the classic video game Quake II. This functions as a tech demo for the gaming capabilities of Microsoft’s Copilot AI platform — though by the company’s own admission, the experience isn’t quite the same as playing a well-made game. You can try it out for yourself, using […]
  • Meta releases two Llama 4 AI models

    Meta has announced Llama 4, its newest collection of AI models that now power the Meta AI assistant on the web and in WhatsApp, Messenger, and Instagram. The two new models, also available to download from Meta or Hugging Face, are Llama 4 Scout — a small model capable of “fitting in a single Nvidia H100 GPU” — and Llama 4 Maverick, which is more akin to GPT-4o and Gemini 2.0 Flash. Meta says it’s still in the process of training Llama 4 Behemoth, which Meta CEO Mark Zuckerberg says is “the highest performing base model in the world.”

    According to Meta, Llama 4 Scout has a 10-million-token context window — the working memory of an AI model — and beats Google’s Gemma 3 and Gemini 2.0 Flash-Lite models, as well as the open-source Mistral 3.1, “across a broad range of widely reported benchmarks,” while still “fitting in a single Nvidia H100 GPU.” Meta makes similar claims about its larger Maverick model’s performance versus OpenAI’s GPT-4o and Google’s Gemini 2.0 Flash, and says its results are comparable to DeepSeek-V3 in coding and reasoning tasks using “less than half the active parameters.”

    Visual comparison of model specs.

    Meanwhile, Llama 4 Behemoth has 288 billion active parameters with 2 trillion parameters in total. While it hasn’t been released yet, Meta says Behemoth can outperform its competitors (in this case GPT-4.5 and Claude Sonnet 3.7) “on several STEM benchmarks.”

    For Llama 4, Meta says it switched to a “mixture of experts” (MoE) architecture, an approach that conserves resources by using only the parts of a model that are needed for a given task. The company plans to discuss future plans for AI models and products at its LlamaCon conference, which is taking place on April 29th.

    As with its past models, Meta calls the Llama 4 collection “open-source,” although Llama has been criticized for its license restrictions. For instance, the Llama 4 license requires commercial entities with more than 700 million monthly active users to request permission from Meta before using its models, which the Open Source Initiative wrote in 2023 takes it “out of the category of ‘Open Source.’”