Skip to content

Commit

Permalink
Version 0.5.2
Browse files Browse the repository at this point in the history
  • Loading branch information
Labbeti committed Mar 23, 2024
1 parent 650057f commit 2a1198c
Show file tree
Hide file tree
Showing 19 changed files with 8,154 additions and 40 deletions.
2 changes: 1 addition & 1 deletion .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
repos:
# Format Code
- repo: https://github.com/ambv/black
rev: 22.10.0
rev: 23.3.0
hooks:
- id: black

Expand Down
4 changes: 4 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,10 @@

All notable changes to this project will be documented in this file.

## [0.5.2] 2024-03-23
### Added
- `freesound_no_clotho_v2` subset to WavCaps to avoid all bias with Clotho test and analysis subsets.

## [0.5.1] 2024-03-04
### Fixed
- WavCaps download preparation (#3).
Expand Down
6 changes: 3 additions & 3 deletions CITATION.cff
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ title: aac-datasets
message: 'If you use this software, please cite it as below.'
type: software
authors:
- given-names: Etienne
- given-names: Étienne
family-names: Labbé
email: [email protected]
affiliation: IRIT
Expand All @@ -22,5 +22,5 @@ keywords:
- captioning
- audio-captioning
license: MIT
version: 0.5.1
date-released: '2024-03-04'
version: 0.5.2
date-released: '2024-03-23'
2 changes: 1 addition & 1 deletion LICENSE
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
MIT License

Copyright (c) 2022 Etienne Labbé (Labbeti)
Copyright (c) 2022 Étienne Labbé (Labbeti)

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
Expand Down
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -74,14 +74,14 @@ aac-datasets-download --root "." clotho --subsets "dev"
| Subsets | `train`, `val`, `test` | `dev`, `val`, `eval`, `dcase_aac_test`, `dcase_aac_analysis`, `dcase_t2a_audio`, `dcase_t2a_captions` | `full` | `as`, `as_noac`, `bbc`, `fsd`, `fsd_nocl`, `sb` |
| Sample rate (kHz) | 32 | 44.1 | 48 | 32 |
| Estimated size (GB) | 43 | 53 | 13 | 941 |
| Audio source | AudioSet | FreeSound | TAU Urban Acoustic Scenes 2019 | AudioSet, BBC Sound Effects, FreeSound, SoundBible | -->
| Audio source | AudioSet | Freesound | TAU Urban Acoustic Scenes 2019 | AudioSet, BBC Sound Effects, Freesound, SoundBible | -->

| Dataset | Sampling<br>rate (kHz) | Estimated<br>size (GB) | Source | Subsets |
|:---:|:---:|:---:|:---:|:---:|
| AudioCaps | 32 | 43 | AudioSet | `train`<br>`val`<br>`test`<br>`train_v2` |
| Clotho | 44.1 | 53 | Freesound | `dev`<br>`val`<br>`eval`<br>`dcase_aac_test`<br>`dcase_aac_analysis`<br>`dcase_t2a_audio`<br>`dcase_t2a_captions` |
| MACS | 48 | 13 | TAU Urban Acoustic Scenes 2019 | `full` |
| WavCaps | 32 | 941 | AudioSet<br>BBC Sound Effects<br>FreeSound<br>SoundBible | `as`<br>`as_noac`<br>`bbc`<br>`fsd`<br>`fsd_nocl`<br>`sb` |
| WavCaps | 32 | 941 | AudioSet<br>BBC Sound Effects<br>Freesound<br>SoundBible | `audioset`<br>`audioset_no_audiocaps`<br>`bbc`<br>`freesound`<br>`freesound_no_clotho`<br>`freesound_no_clotho_v2`<br>`soundbible` |

For Clotho, the **dev** subset should be used for training, val for validation and eval for testing.

Expand Down Expand Up @@ -177,16 +177,16 @@ If you use this software, please consider cite it as "Labbe, E. (2013). aac-data
```
@software{
Labbe_aac_datasets_2024,
author = {Labbé, Etienne},
author = {Labbé, Étienne},
license = {MIT},
month = {03},
title = {{aac-datasets}},
url = {https://github.com/Labbeti/aac-datasets/},
version = {0.5.1},
version = {0.5.2},
year = {2024}
}
```

## Contact
Maintainer:
- Etienne Labbé "Labbeti": [email protected]
- [Étienne Labbé](https://labbeti.github.io/) "Labbeti": [email protected]
Loading

0 comments on commit 2a1198c

Please sign in to comment.