Deccan AI, an AI data and evaluation startup, has raised $25 million in a funding round led by A91 Partners. The round also ...
Researchers have identified key components in large language models (LLMs) that play a critical role in ensuring these AI ...
The focus of artificial-intelligence spending has gone from training models to using them. Here’s how to understand the ...
Have you ever found yourself deep in the weeds of training a language model, wishing for a simpler way to make sense of its learning process? If you’ve struggled with the complexity of configuring ...
OpenAI released a new base model on Thursday called GPT-4.5, which the company said is its best and smartest model for chat yet. It’s not a reasoning model like OpenAI’s o1 and o3 models, but it can ...
Many companies are learning that keeping their AI safe is about more than just adding some cloud security as a makeshift gate ...
Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold ...
Mark Stevenson has previously received funding from Google. The arrival of AI systems called large language models (LLMs), like OpenAI’s ChatGPT chatbot, has been heralded as the start of a new ...
A new academic study challenges a core assumption in developing large language models (LLMs), warning that more pre-training data may not always lead to better models. Researchers from some of the ...
A Chinese AI company's more frugal approach to training large language models could point toward a less energy-intensive—and more climate-friendly—future for AI, according to some energy analysts. "It ...