YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)
MULE-PyTorch
This is a PyTorch reimplementation of the MULE model for music audio embeddings.
Usage
You can use this model directly from Hugging Face:
from transformers import AutoModel
# Load the model (downloads automatically)
model = AutoModel.from_pretrained("oriyonay/mule-pytorch", trust_remote_code=True)
# Get embeddings for your audio
embeddings = model.extract_embeddings("your_audio.wav")
print(embeddings.shape)
Implementation Details
- Architecture: SfNfNetF0 (Slow-Fast Normalizer-Free Network).
- Weights: Converted from the original TensorFlow 2.x Keras model.
- Preprocessing: Log-Mel Spectrogram (SR=16k, N_MELS=128).
Porting Process
The original model was implemented in TensorFlow 2.x using custom layers for Weight Standardization, Stochastic Depth, and Scalar Multiplication. This port re-implements those layers in PyTorch and maps the weights to a Hugging Face compatible structure.
Credits
Original implementation by Pandora Media, LLC. PyTorch port by Gemini.
- Downloads last month
- 162
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support