Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Downloading AudioCaps data #36

Open
fyell opened this issue Oct 12, 2023 · 2 comments
Open

Downloading AudioCaps data #36

fyell opened this issue Oct 12, 2023 · 2 comments

Comments

@fyell
Copy link

fyell commented Oct 12, 2023

Hi,

I'm trying to download the AudioCaps data in order to train the Tango model. However, I'm not seeing any instructions in the AudioCaps repository on how to download it. Can you share any scripts or instructions on how to download and format the audio to train Tango?

Thanks!

@deepanwayx
Copy link
Collaborator

You need to use something like youtube-dl to download the audio files from youtube.

Otherwise, you may want to download the WavCaps dataset and extract the audios from the zip files. This will include AudioCaps among several other datasets.

The dataset already provides a ChatGPT-generated caption for each audio file, but you can probably map the audio files to the original AudioCaps captions using the filenames.

@xzm-whq
Copy link

xzm-whq commented Dec 23, 2024

你好

我正在尝试下载 AudioCaps 数据以训练 Tango 模型。但是,我在 AudioCaps 存储库中没有看到任何有关如何下载它的说明。您能否分享有关如何下载和格式化音频以训练 Tango 的任何脚本或说明?

谢谢!

Hello, I have also encountered this problem, please how is your dataset downloaded

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants