GitHub - youngjuene/microvar: Official PyTorch implementation of Micro-variation of Sound Objects Using Component Separation and Diffusion Models (ICMC 2023)

Micro-variation of Sound Objects Using Component Separation and Diffusion Models

Official PyTorch implementation of Micro-variation of Sound Objects Using Component Separation and Diffusion Models (ICMC 2023).

Requirements

Create conda environment

conda create -n microvar python=3.8 -y
conda activate microvar
conda env update -f environment.yaml

Place the desired audio dataset in data directory and preprocess it as follows

cd mvd
python segment_audio.py --audio_dir {directory of original dataset}
python preprocess.py --audio_dir {directory with segmented audio files} --sep {separation options}

Train the model on custom datasets Run the train.py script to train the model.
Replace {model name} with the desired name for your model, and {directory with preprocessed audio files} with the path to the preprocessed audio files.

python train.py --model_dir {model name} --data_dirs {directory with preprocessed audio files}

Generate samples using the model checkpoints Run the generate.py script to generate samples using the trained model.
Replace {model name} with the name of directory where your trained model is located, {input audio} with the name of the input audio file and {output audio filename} with the desired name for the output audio file.

python generate.py --ckpt_dir {ckpt dir} -i {input audio} -o {output audio filename}

Demo Using Pretrained Models

Download model checkpoints (WIP)

wget https://zenodo.org/record/00000/files/mvd.tar.gz
tar -zxvf mvd.tar.gz

Run generate.py by specifying the pretrained checkpoint

python generate.py --ckpt_path {pretrained ckpt} -i {input audio} -o {output audio filename}

Try out other examples on Max/MSP and Unreal Engine
Download the project files Link

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

Citation

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@article{micro2023liu,
  title={Micro-variation of Sound Objects Using Component Separation and Diffusion Models},
  author={a, b, c},
  proceedings={International Computer Music Conference},
  year={2023}
}

TODO

Check volume issue
Upload downloadable project files
...

Reference

DiffWave: A Versatile Diffusion Model for Audio Synthesis

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
mvd		mvd
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Micro-variation of Sound Objects Using Component Separation and Diffusion Models

Requirements

Demo Using Pretrained Models

License

Citation

TODO

Reference

About

Releases

Packages

Languages

License

youngjuene/microvar

Folders and files

Latest commit

History

Repository files navigation

Micro-variation of Sound Objects Using Component Separation and Diffusion Models

Requirements

Demo Using Pretrained Models

License

Citation

TODO

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages