# Madverse Music: AI Audio Classifier - Usage Guide ## Quick Start ### Option 1: Hugging Face Space (Recommended) Use our deployed model on Hugging Face Spaces: **Web Interface:** 1. Go to the Hugging Face Space URL 2. Upload your audio file 3. Click "Analyze Audio" 4. Get instant results **API Access:** ```bash # Health check curl https://your-space-name.hf.space/health # Analyze audio file curl -X POST "https://your-space-name.hf.space/analyze" \ -F "file=@your-song.mp3" ``` ### Option 2: Local Setup ```bash # Install dependencies pip install -r requirements.txt # Start the API server python api.py # Or start web interface streamlit run app.py ``` ## Supported Audio Formats - WAV (.wav) - MP3 (.mp3) - FLAC (.flac) - M4A (.m4a) - OGG (.ogg) ## API Usage ### Hugging Face Space API #### Health Check ```bash GET /health ``` #### Analyze Audio ```bash POST /analyze ``` Upload audio file using multipart/form-data **Request:** Upload file using form data with field name "file" **Response Format:** ```json { "classification": "Real", "confidence": 0.85, "probability": 0.15, "raw_score": -1.73, "duration": 30.5, "message": "Detected as real music" } ``` ### Usage Examples #### Python ```python import requests # Upload file to HF Space with open('your_song.mp3', 'rb') as f: response = requests.post('https://your-space-name.hf.space/analyze', files={'file': f}) result = response.json() print(result) ``` #### JavaScript ```javascript const formData = new FormData(); formData.append('file', fileInput.files[0]); const response = await fetch('https://your-space-name.hf.space/analyze', { method: 'POST', body: formData }); const result = await response.json(); ``` ## Understanding Results The classifier will output: - **"Real"** = Human-created music - **"Fake"** = AI-generated music (from Suno, Udio, etc.) ### API Response Format: ```json { "classification": "Real", "confidence": 0.85, "probability": 0.15, "raw_score": -1.73, "duration": 30.5, "message": "Detected as real music" } ``` ### Command Line Output: ``` Analyzing: my_song.wav Result: Fake (AI-generated music) Confidence: 0.96 | Raw output: 3.786 ``` ## Model Specifications - Model: SpecTTTra-α (120 seconds) - Sample Rate: 16kHz - Performance: 97% F1 score, 96% sensitivity, 99% specificity - Max Duration: 120 seconds (2 minutes) ## Technical Details ### How It Works: 1. Audio is loaded and resampled to 16kHz 2. Converted to mel-spectrograms 3. Processed by the SpecTTTra transformer model 4. Output logit is converted to probability using sigmoid 5. Classification: `prob < 0.5` = Real, `prob ≥ 0.5` = Fake ### Testing Your Music 1. Get AI-generated samples: Download from Suno, Udio, or other AI music platforms 2. Get real music samples: Use traditional human-created songs 3. Run the classifier: Compare results to see how well it detects AI vs human music ## Expected Performance - High accuracy on detecting modern AI-generated music - Works best with full songs (up to 120 seconds) - Optimized for music from platforms like Suno and Udio Note: This model was trained specifically for detecting AI-generated songs, not just AI vocals over real instrumentals. It analyzes the entire musical composition.