Jetson Generative AI – AudioCraft

Audio generation reaches new creative heights with Meta’s AudioCraft—a powerful suite of models that can produce high-quality audio and music from text descriptions in real time on your Jetson device.
Requirements
Hardware / Software |
Notes
|
---|---|
Jetson AGX Orin (64GB)
|
Recommended for best performance
|
Jetson AGX Orin (32GB)
|
Good performance for most use cases
|
Jetson Orin Nano (8GB)
|
Minimum requirement
|
JetPack 5 (L4T r35.x) |
Required for compatibility
|
NVMe SSD highly recommended
|
For storage speed and space
|
10.7 GB for audiocraft container
|
Container image storage
|
Additional space for checkpoints
|
Model weights and generated audio
|
Note: AudioCraft can generate both environmental sounds and music, making it versatile for creative projects, sound design, and music production.
Step-by-Step Setup
1. Clone jetson-containers repository
2. Install jetson-containers
3. Launch AudioCraft container
Use the run script and autotag to automatically pull or build a compatible container:
The container has a default run command that automatically starts the Jupyter Lab server.
4. Access Jupyter Lab interface
Open your browser and navigate to:
http://:8888

AudioCraft Jupyter Interface
5. Navigate to Demos
In the Jupyter Lab navigation pane on the left, double-click the `demos` folder to access the AudioCraft demonstration notebooks.
AudioGen Demo
Getting Started
Audio Continuation
Generate audio continuation based on existing audio and text descriptions:
Generated Audio Example:
Text-Conditional Generation
Generate audio directly from text descriptions:
Generated Audio Example:
Tip: Check the AudioGen notebook for more text-conditional generation examples and different audio categories.
MAGNeT Demo
Text-to-Music Generation
Generated Audio Example:
Tip: Explore the MAGNeT notebook for additional music generation examples and different genre prompts.
Advanced Text Prompting
Generated Audio Example:
Tip: See the MAGNeT notebook for more advanced prompting techniques and parameter customization options.
MusicGen Demo
Music Continuation
Generated Audio Example:
Tip: Visit the MusicGen notebook for more music continuation examples and different musical styles.
Musicgen Web Interface
AudioCraft also provides a user-friendly Gradio-based web interface for easy experimentation without coding.
http://<jetson-ip>:7860/

AudioCraft Gradio Interface
The Gradio interface provides:
Troubleshooting
Issue
|
Fix
|
---|---|
Triton warning message
|
Safe to ignore, models will work without Triton optimizations
|
Model download fails
|
Check internet connection and available disk space (>15GB)
|
Out of memory errors
|
Use smaller models, reduce batch size, restart kernel
|
Audio playback issues
|
Ensure browser supports audio playback, check volume settings
|
Slow generation
|
Use smaller models, reduce audio duration, check GPU memory
|
Container fails to start
|
Ensure jetson-containers is properly installed
|
For more information about AudioCraft and advanced configurations, visit the AudioCraft GitHub repository and jetson-containers documentation.