Tech News

Nvidia becomes a major model maker with nemotron 3

Nvidia did It is fortunate to provide chips to companies working in artificial intelligence, but today the chipmaker took the step to become a more important model itself by releasing a series of cutting-edge models, as well as data and tools to help engineers use them.

The move, which comes at a moment when AI companies like Opelai, Google and Anthropic are developing chips for their abilities, could be a hedge against companies from Nvidia’s technology in the long run.

Open models are already an important part of the AI ​​ecosystem with many researchers and startups using them to test, prototype and build. While Opelai and Google offer small open models, they don’t update them as often as their competitors in China. For this reason and others, open models from Chinese companies are currently very popular, according to information from the face of the hugging, hosting platform for open source projects.

NVIDIA’s new Nemotron 3 models are among the best that can be downloaded, converted, and run on your hardware, according to Harchmark Spores shared by the company before release.

“Innovation is at the core of AI development,” CEO Jensen Huang said in a statement ahead of the news. “With Nemotron, we’re turning advanced AI into an open platform that gives developers the clarity and efficiency they need to build agentic systems at scale.”

Nvidia has taken a more visible approach than many of its US competitors by releasing the data used to train the nemotron – a fact that should help developers change models more easily. The company also releases tools to help with customization and optimization. This includes a new hybrid of the hybrid of the hybrid of the professional model, means that Nvidia says that it is very good at creating Agents ai that can take action on computers or through the web. The company also introduces libraries that allow users to train agents to do things using learning materials, including providing reward and punishment models.

Nemotron 3 models come in three sizes: Nano, with 30 billion parameters; Super, with 100 billion; and Ultra, with 500 billion. The parameters of the model are closely related to how it knows it is appropriate and how it should work. The largest models are so robust that they need to run on expensive hardware racks.

Basics of the model

Kari Ann Briski, Vice President of Enterprise AI Software at Nvidia, said open models are important to AI developers for three reasons: Developers increasingly need to customize models for specific tasks; It often helps to ask questions in different models; And it’s easy to narrow down the most intelligent responses from these types after training yourself to have them do the same type of thinking. “We believe that open source is the foundation for innovation, to continue to accelerate the global economy,” Briski said.

Social Media Giant Meta released the first advanced models under the name LLama in February 2023. As competition has intensified, however, Meta has signaled that its future releases may not be open source.

Mobility is part of a larger trend in the AI ​​industry. In the past year, US firms have moved away from openness, becoming more secretive about their research and more reluctant to confront their latest rivals.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button