Aiming to be an OpenAI Alternative! SuperNova: A Customizable, Instruction-Following Large Language Model for Enterprises

AIbase基地

Published inAI News · 11 min read · Sep 11, 2024

191

Arcee AI today introduced SuperNova, a 700-billion-parameter language model designed for enterprise deployment with advanced instruction-following capabilities and full customization options. The model aims to provide a robust, ownable alternative for enterprise data privacy, model stability, and customization, addressing key issues with API-based services such as OpenAI and Anthropic.

In the AI landscape dominated by cloud APIs, Arcee AI has taken a different approach with SuperNova. This large language model (LLM) can be deployed and customized within a company's own infrastructure. SuperNova was launched today, based on Meta's Llama-3.1-70B-Instruct architecture, and incorporates Arcee's superior instruction-following capabilities and a novel post-training process tailored to specific business needs.

Technological Innovation

SuperNova's development involved a multifaceted post-training approach.

Project lead engineer Lucas Atkins revealed the secret: "We trained three models: one distilled from Llama405B, one fed with our EvolKit-generated dataset, and one deeply modified with DPO on Llama3instruct. Finally, we combined them with a new magic, preserving the superpowers of each model."

Arcee claims this has resulted in SuperNova's instruction-following abilities, particularly from the distillation of the 405B parameter model, which not only shows SuperNova's ability to capture the essence of larger models while maintaining moderate hardware deployment.

Enterprise Deployment and Customization

SuperNova is designed to be deployed in the enterprise's own cloud environment, initially available on AWS Marketplace. Arcee is also working to make it available on Google and Azure marketplaces.

Arcee AI's co-founder Mark McQuade emphasized the benefits of this deployment method: "The model is deployed in your AWS VPC, but it also launches a web server, a chat interface, and a database to store your chat history. Everyone in your organization can interact closely with it."

This deployment method addresses concerns about data privacy and model stability. Unlike API-based services that may change without notice, SuperNova gives enterprises complete control. McQuade noted, given the recent turmoil in the AI industry, this is particularly important: "OpenAI just abandoned 3.5... many companies built their businesses around the 3.5 API. So when that API changes, your application crashes. But in our world, nothing changes unless you want it to, because it's your model, your way of running it."

Customization and Continuous Improvement

A major selling point of SuperNova is its ability to be fine-tuned and retrained in the enterprise environment.

Atkins explained the process and its benefits: "Over time, we can retrain the model entirely in your own environment to better align with your preferences. As we save these chats, if you want the model to improve comprehensively according to your unique preferences as a business, we have the ability to do so without letting data leave your system."

This capability allows technical teams to adapt the model to specific domain knowledge or company-specific requirements, a significant advantage over cloud-based API services that typically do not allow such customization.

Open Source Components

While the full 70B model is not open source, Arcee is releasing several components for the developer community:

A free API for testing and evaluation: This allows developers to try SuperNova without committing to full deployment.

SuperNova-Lite: An 8B parameter open-source version model. This smaller model may be useful for developers working in resource-constrained environments or those wanting to understand the architecture before deploying the full model.

EvolKit: Their dataset generation pipeline for creating complex QA pairs. This tool may be valuable for organizations looking to create custom training data for their specific use cases.

By open-sourcing these components, Arcee contributes to a broader AI community while also providing potential customers with tools to evaluate and customize their products. Arcee SuperNova is also available on AWS Marketplace.

Performance Claims and Benchmarking

Arcee claims SuperNova performs well across various domains, especially in mathematical reasoning. "Atkins noted: 'This performs exceptionally well on mathematical benchmarks.'" However, the company encourages third-party evaluations to validate their claims.

"We will provide an API for people to use. If third parties want to run trusted benchmarks to evaluate it themselves, we can arrange to provide them with access to the weights. We want full transparency for this model," Atkins said.

This openness to third-party evaluation is commendable as it allows independent verification of Arcee's claims. It will be particularly interesting to see how SuperNova performs against leading AI company models like OpenAI, Anthropic, etc., on standard benchmarks.

Impact on Enterprise AI Strategies

The launch of SuperNova comes at a time when many enterprises are reevaluating their AI strategies. While cloud-based API services have dominated the field, there is growing interest in deployable, customizable models that offer more control and flexibility.

SuperNova's approach addresses several key issues:

Data Privacy: By deploying within the company's own infrastructure, SuperNova ensures sensitive data never leaves the organization's control.
Model Stability: Unlike API services that may change or be deprecated without notice, SuperNova provides a stable foundation that only changes when the organization chooses to update.
Customization: The ability to fine-tune and retrain the model on company-specific data allows for deep customization not possible with most API services.
Cost Control: While the initial deployment may require significant resources, the long-term costs of running SuperNova may be lower than paying for large-scale API calls.
Competitive Advantage: A customized, continuously improving AI model can provide a significant competitive advantage in industries where AI-driven insights are crucial.

The AI Sovereignty Dilemma

As enterprises navigate the rapidly evolving AI landscape, the launch of SuperNova reveals growing tensions in the industry: the convenience and capabilities of cloud-based AI services versus the control and customization offered by deployable models. This dichotomy presents what we might call the "AI Sovereignty Dilemma."

On one hand, cloud-based API services like GPT-4 and Claude offer state-of-the-art performance and continuous updates, but at the cost of data privacy issues and limited customization. On the other hand, models like SuperNova promise complete control and customization but require internal expertise for deployment and maintenance.

Arcee's approach with SuperNova attempts to bridge this gap, offering a model that can be deployed locally while still providing capabilities intended to rival leading cloud-based services. This hybrid approach may be particularly appealing to industries with strict regulatory requirements or those dealing with highly sensitive data.

Official Blog: https://blog.arcee.ai/meet-arcee-supernova-our-flagship-70b-model-alternative-to-openai/

Google Launches New Veo 3 Video Generation Model Globally

Google announced the global launch of its latest video generation model, Veo3. This long-anticipated release has generated great excitement among users, as Veo3 is now available to Gemini users in over 159 countries, offering a new video creation experience. The key feature of the Veo3 video generation model is its ability to generate videos up to eight seconds long based on simple text prompts. According to Google, this technology is designed for creative users, especially those on social media who increasingly demand short-form content.

DeepMind introduces Crome: Enhancing the Alignment of Large Language Models with Human Feedback

In the field of artificial intelligence, reward models are a critical component for aligning large language models (LLMs) with human feedback, but existing models face the issue of "reward hacking." These models often focus on superficial features, such as the length or format of responses, rather than identifying genuine quality metrics, such as factual accuracy and relevance. The root cause lies in standard training objectives failing to distinguish between spurious associations and true causal drivers present in the training data. This failure leads to fragile reward models (RMs), which generate misaligned policies.

Kunlun Xiwang Once Again Open-Sources the Reward Model Skywork-Reward-V2

On July 4, 2025, Kunlun Xiwang continued to open-source the second-generation reward model Skywork-Reward-V2 series. This series includes 8 reward models based on different foundation models, with parameter sizes ranging from 600 million to 8 billion. Upon its release, it won all seven major reward model evaluation rankings, becoming a focus in the open-source reward model field. Reward models play a key role in the reinforcement learning from human feedback (RLHF) process. To build the next generation of reward models, Kunlun Xiwang has constructed a dataset containing 40 million

China's Medical Large Model Release Volume Accounts for 70% of the Global Total! KPMG Reveals Future Market Potential

According to KPMG China's recent report, "The First 50 Health Tech Companies," China accounts for more than 70% of the global release volume of medical large models. This data not only demonstrates China's rapid development in the field of intelligent healthcare, but also reflects the wide application of large language models in the healthcare industry. The report points out that about 65% of the currently released medical large models are large language models. These models can process and generate natural language, playing a significant supporting role in the analysis of medical data, patient communication, and scientific research.

New Developments in OpenAI Copyright Lawsuit: The New York Times Will Have Access to Deleted User Data

In the long-standing copyright infringement lawsuit filed by The New York Times against OpenAI, the case has made significant progress. According to Ars Technica, the federal judge presiding over the case has authorized The New York Times and its co-plaintiffs, The New York Daily News and the Investigative Reporting Center, to access OpenAI's user logs, including deleted content, to accurately determine the scope of the infringement. The New York Times believes that ChatGPT users may delete their history after bypassing the paywall, and therefore it is necessary to conduct large-scale data collection.

Xiaopeng G7 Ultra Makes a Grand Debut! Revolutionary Intelligent Driving Large Model Unveiled

In the new energy vehicle market, Xiaopeng Automotive has once again drawn attention. On July 3rd, the Xiaopeng G7 Ultra was officially launched, becoming the first intelligent vehicle equipped with the local-end "VLA+VLM" large model. This innovative technology marks an important step forward for Xiaopeng in the field of intelligent driving. The Xiaopeng G7 Ultra is equipped with the VLA (active thinking and rapid decision-making capability) large model, making the driving experience more intelligent. In daily driving, the G7 Ultra can flexibly handle various complex driving scenarios, such as in traffic.

Shortcut Makes Its Debut! AI Excel Assistant Surpasses Human Champions by 10 Times, Task Automation Efficiency Soars

Recently, an AI Excel assistant called Shortcut has sparked heated discussions on social media. It enables users to effortlessly complete Excel tasks without writing complex formulas or VBA code through natural language processing (NLP) technology. The AIbase editorial team has compiled the latest information from social media to provide an in-depth analysis of Shortcut's powerful features and its potential impact on the fields of data processing and financial modeling. Shortcut: An Excel Revolution Driven by Natural Language

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Aiming to be an OpenAI Alternative! SuperNova: A Customizable, Instruction-Following Large Language Model for Enterprises

AIbase基地

This article is from AIbase Daily

AI News Recommendations