New API - AI Model Serving Tool

Overview

New API is an open-source, next-generation LLM gateway and AI asset management system that standardizes access to multiple large model APIs. It provides a rich UI, multi-language support, online recharge, usage tracking, token grouping, model charging, and configurable reasoning effort for personal and enterprise internal management and distribution.

Key Features

  • Standardized interface for multiple large model APIs such as OpenAI and Claude
  • Rich web-based user interface
  • Multi-language support
  • Online recharge and billing controls
  • Usage tracking and token grouping
  • Model-level charging configuration
  • Configurable reasoning effort per model or request
  • Designed for personal and enterprise internal distribution

Ideal Use Cases

  • Unify calls to multiple LLM providers for applications
  • Centralize internal AI assets and model access control
  • Track and allocate usage and costs across teams
  • Offer paid access or charging per model or request
  • Provide a configurable gateway for enterprise deployments

Getting Started

  • Open the GitHub repository: https://github.com/QuantumNous/new-api
  • Clone or download the repository locally
  • Review the project's README and configuration documentation
  • Configure external model provider API keys and mappings
  • Deploy or run the provided web UI
  • Enable online recharge, billing, and usage tracking features
  • Test endpoints and verify token and usage reporting

Pricing

Not disclosed in source; repository is available on GitHub.

Key Information

  • Category: Model Serving
  • Type: AI Model Serving Tool