<!– |
MONYC |
Music of New York City (MONYC) is a dataset of 1.5k music clips as recorded by the sensors of the Sounds of New York City (SONYC) project –> |
Open-set Tagging (OST) |
A synthetic dataset of 500k 1s clips with controlled polyphony and seen/unseen class assignments to investigate source-centric audio representation learning and open-set audio classification. |
|
FSD-MIX-CLIPS |
A dataset of 614,533 programmatically mixed 1-s audio clips with a controlled level of polyphony and signal-to-noise ratio |
|
FSD-SED |
A dataset of 281,039 programmatically mixed 10-s strongly-labeled audio clips with a controlled level of polyphony and signal-to-noise ratio |
|
SONYC-Backgrounds |
A dataset of 441 recordings of urban background noise obtained from the SONYC acoustic sensor network |
|
SNUSS |
Synthetic noisy urban soundscapes (SNUSS) is a dataset of 30,000 synthetic soundscapes with real urban background noise meant to mimic urban soundscapes |
|
SONYC-UST-V2 |
An Urban Sound Tagging Dataset with Spatiotemporal Context |
|
SONYC-UST |
SONYC Urban Sound Tagging (SONYC-UST): a multilabel dataset from an urban acoustic sensor network |
|
URBAN-SED |
10,000 synthesized soundscapes with strong annotations |
|
VocalSketch |
Vocal imitations of a large set of diverse sounds |
|
VimSketch |
Vocal imitations of an even larger set of diverse sounds (VocalSketch combined with Vocal Imitation Set) |
|
Tunebot |
A query-by-humming dataset with 10,000 sung contributions |
|