MCP Reference - Audio Console

← Back to Home

What is MCP? MCP (Model Context Protocol) is a protocol that allows AI assistants like ChatGPT to interact with external tools and services. Audio Console provides a set of MCP tools for audio editing, conversion, and analysis.

How to use: These tools are automatically available when Audio Console is configured as an MCP server. In ChatGPT and other MCP-compatible clients, you can ask the AI to perform audio operations, and it will use these tools automatically.

UI Tools: Some tools open an interactive UI widget within ChatGPT, allowing you to visually edit audio with waveform visualization, trim controls, and fade effects. Other tools perform operations directly without opening a UI.

Quickstart Guide

Step 1: Get the MCP Server URL

The Audio Console MCP server is available at:

https://www.audioconsole.app/mcp

Step 2: Configure Your MCP Client

Add Audio Console to your MCP client configuration. For ChatGPT, add the following to your MCP settings:

{
  "mcpServers": {
    "audio-console": {
      "url": "https://www.audioconsole.app/mcp"
    }
  }
}
                

Step 3: Start Using the Tools

Once configured, you can ask ChatGPT or your MCP client to perform audio operations. For example:

"Open the audio editor with this file"
"Convert this audio to MP3"
"Trim the first 30 seconds of this audio"
"Separate the vocals from this song"
"Detect the BPM and key of this track"

Step 4: Explore the Tools

Browse the tools below to see all available operations. Tools marked with MCP + UI will open an interactive editor in ChatGPT, while others perform operations directly.

Tools with UI MCP + UI

These tools open an interactive widget within ChatGPT and other MCP clients that support UI rendering. They provide visual waveform editing, trim controls, and real-time preview.

audio.open_audio_editor

Open Audio Editor

Opens an interactive audio editor widget for trimming, fading, and format conversion. Supports MP3, WAV, FLAC, OGG, M4A, and M4R formats. Use this for custom trim points, fade adjustments, or when you want visual waveform editing. The editor provides dual-track processing for vocals and music separation.

Parameters:

audioFile

Object (optional)

Audio file uploaded by the user in the chat. Contains download_url and file_id. Use this when the user attaches an audio file.

audioUrl

String URL (optional)

Public HTTPS URL to an audio file. Use this when the user provides a direct link to an audio file.

audio.open_ringtone_editor

Open Ringtone Editor

Opens a specialized ringtone editor widget optimized for creating ringtones. Same UI as the audio editor but with format options optimized for iPhone (M4R) and Android (OGG). Use this specifically for ringtones. Best for audio clips 30 seconds or less.

Parameters:

audioFile

Object (optional)

Audio file uploaded by the user in the chat. Contains download_url and file_id. Use this when the user attaches an audio file.

audioUrl

String URL (optional)

Public HTTPS URL to an audio file. Use this when the user provides a direct link to an audio file.

Direct Operation Tools

These tools perform operations directly without opening a UI. They process audio and return results immediately. Perfect for automated workflows and batch processing.

audio.convert_from_url

Convert Audio Format

Converts an audio file from a public URL to a different format without editing. Supported formats: MP3, WAV, FLAC, OGG, M4A, M4R. Use this for simple format conversion only. For editing, use audio.open_audio_editor instead.

Parameters:

audioUrl

String URL (required)

Public HTTPS URL of the audio to convert. Example: https://cdn.example.com/audio/song.wav

format

Enum (required)

Target format: mp3, wav, flac, ogg, m4a, m4r

trackName

String (optional, max 80 chars)

Optional display name for the exported file. Example: Session_Mix

audio.trim_start_of_audio

Trim Start of Audio

Extracts exactly the first 30 seconds of an audio file with automatic fade in/out effects. Use this for quick intro extraction. Requires audio to be at least 30 seconds long. For custom trim points, use audio.open_audio_editor instead.

Parameters:

audioUrl

String URL (required)

Public HTTPS URL of the audio file. Example: https://cdn.example.com/audio/song.mp3

format

Enum (optional)

Target audio format. Supported: mp3, wav, flac, ogg, m4a, m4r. Defaults to mp3.

trackName

String (optional, max 80 chars)

Optional display name for the exported file. Example: Intro_30s

audio.trim_end_of_audio

Trim End of Audio

Extracts exactly the last 30 seconds of an audio file with automatic fade in/out effects. Use this for quick outro extraction. Requires audio to be at least 30 seconds long. For custom trim points, use audio.open_audio_editor instead.

Parameters:

audioUrl

String URL (required)

Public HTTPS URL of the audio file. Example: https://cdn.example.com/audio/song.mp3

format

Enum (optional)

Target audio format. Supported: mp3, wav, flac, ogg, m4a, m4r. Defaults to mp3.

trackName

String (optional, max 80 chars)

Optional display name for the exported file. Example: Outro_30s

audio.separate_voice_from_music

Separate Voice from Music

Separates vocals from music and returns both tracks (vocals and instrumental). Use this when you explicitly want both separated tracks. Works best with music that has clear vocal and instrumental separation. Uses LALAL.AI for high-quality separation.

Parameters:

audioFile

Object (optional)

Audio file uploaded by the user in the chat. Contains download_url and file_id.

audioUrl

String URL (optional)

Public HTTPS URL of the audio file to separate. Either audioFile or audioUrl must be provided.

trackName

String (optional, max 80 chars)

Optional display name for the output files. Example: My_Song

audio.remove_vocals

Remove Vocals

Removes vocals from audio and returns only the instrumental/background music track. Use this when you explicitly want instrumental only. Works best with music that has clear vocal and instrumental separation.

Parameters:

audioFile

Object (optional)

Audio file uploaded by the user in the chat. Contains download_url and file_id.

audioUrl

String URL (optional)

Public HTTPS URL of the audio file. Either audioFile or audioUrl must be provided.

trackName

String (optional, max 80 chars)

Optional display name for the output file. Example: My_Song_Instrumental

audio.extract_vocals

Extract Vocals

Extracts vocals from audio and returns only the vocal track without music. Use this when you explicitly want vocals only. Works best with music that has clear vocal and instrumental separation.

Parameters:

audioFile

Object (optional)

Audio file uploaded by the user in the chat. Contains download_url and file_id.

audioUrl

String URL (optional)

Public HTTPS URL of the audio file. Either audioFile or audioUrl must be provided.

trackName

String (optional, max 80 chars)

Optional display name for the output file. Example: My_Song_Vocals

audio.detect_bpm_and_key

Detect BPM and Key

Analyzes audio to detect BPM (beats per minute/tempo) and musical key. Use this for music analysis, DJ mixing, or music production purposes. This tool only provides analysis information - it does not edit or convert audio. Works best with music tracks that have clear rhythm and harmonic content.

Parameters:

audioFile

Object (optional)

Audio file uploaded by the user in the chat. Contains download_url and file_id.

audioUrl

String URL (optional)

Public HTTPS URL of the audio file to analyze. Either audioFile or audioUrl must be provided.

audio.notify_download_link_ready

Share Generated Audio Link

Notifies ChatGPT about audio download links. This tool is typically called automatically by the widget after processing completes. Use this only after the file upload is complete. Primarily used internally by the audio editor widget.

Parameters:

downloadUrl

String URL (required)

Public HTTPS URL where the generated audio can be downloaded. Example: https://downloads.example.com/audio/final.mp3

fileName

String (optional, max 120 chars)

Suggested file name shown to the user. Example: Final.mp3

format

Enum (optional)

Target audio format. Supported: mp3, wav, flac, ogg, m4a, m4r