Conversational Speech dataset

Publicly available dataset of conversational speech audio recordings.
This CL includes the following:
- README.md: dataset description file, it also includes the scripts
- *.wav.sha1: hash files for each audio track in the dataset

The overall size of the wav files is ~36MB.
The primary intended use of this dataset is in combination with the conversational speech tool (see https://chromium.googlesource.com/external/webrtc/+/master/webrtc/modules/audio_processing/test/conversational_speech/), using which longer recordings with custom turn switch timing can be created.

BUG=webrtc:7218

Review-Url: https://codereview.webrtc.org/2869833002
Cr-Commit-Position: refs/heads/master@{#18068}
This commit is contained in:
alessiob 2017-05-09 07:11:03 -07:00 committed by Commit bot
parent f9e5ebcac7
commit 77b376c094
67 changed files with 129 additions and 0 deletions

View File

@ -0,0 +1 @@
3be3b93e5d2d346dc234c73de9d70f3363477c98

View File

@ -0,0 +1 @@
17cc4500202ff01f99eaedbf0e48313bc6bc9ed2

View File

@ -0,0 +1 @@
22301b5ffcd00550e5a1c17d4475269b913e0a40

View File

@ -0,0 +1 @@
1cdedd87f100a0b152caef29dd8bf80458fa3bfc

View File

@ -0,0 +1 @@
35c35112547b87ae62470e71eb6ecb6c3ede6d6b

View File

@ -0,0 +1 @@
d2c4623a35cd9bd341d3f69e883ae552ba82128e

View File

@ -0,0 +1 @@
b08e20876c28d8f7b804121b0ada68bb979904fd

View File

@ -0,0 +1 @@
4e62ca314a3c509718c3bf9d4d274d7bfa9b1201

View File

@ -0,0 +1 @@
886c479f881fcf5ec1f89eb8d5f1d0aac906e1c8

View File

@ -0,0 +1 @@
b66c18917897b7ec189e18534eeead51656676ba

View File

@ -0,0 +1 @@
01f401a785ea59d293b9aa91720793ef246cda1d

View File

@ -0,0 +1 @@
0f33fc0072e8dd7be69a5e9ec64ae21d08d382b2

View File

@ -0,0 +1 @@
7774368c0285c2e37c3bee74fbc6b6287e48dc27

View File

@ -0,0 +1 @@
d75a37ce09456d64e8e764dabf4e1c5f62a3021e

View File

@ -0,0 +1 @@
91b128d52a8c2790b4d7ddcef617373e3e9a0fd3

View File

@ -0,0 +1 @@
bca4edcb8007d93185052493569c24cc4248f684

View File

@ -0,0 +1 @@
a6a0265063f9e7b1ba830a35dda2f05399dec3c1

View File

@ -0,0 +1 @@
d480ad07c11907ef985d2f726b783ed623945ab2

View File

@ -0,0 +1 @@
0060e174e4563bda2555bed1360e6e592d53385a

View File

@ -0,0 +1 @@
83e10191061c866177fc4ea846f4f6a96f76e7ea

View File

@ -0,0 +1 @@
cb14a4f9a3f7e01830d4de85830a379f1da5c7aa

View File

@ -0,0 +1 @@
eda398bd6c1e8df0c689df089f54e71c9db05c2e

View File

@ -0,0 +1 @@
18d61c3561eb3b66ce0937599f0e11ea0772a586

View File

@ -0,0 +1 @@
79c00d4487f8412cd18f735958deae625f7957ac

View File

@ -0,0 +1 @@
15506843d6a498334bb3dde70c891a069e7b907d

View File

@ -0,0 +1 @@
0ca6092950c59edd00a0b659c32c5f20f3a17076

View File

@ -0,0 +1 @@
5abaf53dc62f0e48e4b96cdfb89bc8e2ece358c5

View File

@ -0,0 +1 @@
03098c059e91ebc17fc4d24c665eeb5f5c61d0ad

View File

@ -0,0 +1 @@
8e02a66f1056dea5da74836637f929cee3eb7a25

View File

@ -0,0 +1 @@
d8951ec345c2df067bd205deb185cd264a4d5cb9

View File

@ -0,0 +1 @@
be92633ad4ffba1421d29ba50aa707620ea9816e

View File

@ -0,0 +1 @@
d1262e287cf97c41c2d8cd23f5d47b02cff56ed0

View File

@ -0,0 +1 @@
abdbd2a518c2fa26ffb922b4279aa32f6a1c7366

View File

@ -0,0 +1 @@
68d972235da27733e5f1e140e1163343b376e673

View File

@ -0,0 +1 @@
2025e8b85291774cae2ef9411a0b019e4535a821

View File

@ -0,0 +1 @@
4f8e399280eddde780d7d57d122949d3708c7e97

View File

@ -0,0 +1,63 @@
# WebRTC Conversational Speech dataset
This dataset includes *conversational speech* and *breathing* recordings,
and the audio files are named
``<langCode>_script<num>_<speakerGender>_sp<speakerId>_<turnId>.wav`` and
``breathing_<speakerGender>_sp<speakerId>_<num>.wav`` respectively.
The conversational speech part consists of *turns*. Each turn belongs to a
*script*, and a script involves two *speakers* named A and B.
The scripts have been designed spanning short and long turns. Also, the scripts
are gender neutral, which allows to let A and B turns be recorded by both male
and female speakers.
All the audio files have a single channel and a sample rate of 48000 Hz with 16
bit samples.
## English scripts (EN)
### Script 1
- A1: How is your day going?
- B1: Quite busy. I'm preparing for my presentation on our marketing strategy.
- A2: You must feel stressed out now.
- B2: That's an understatement.
### Script 2
- A1: Which book are you reading?
- B1: Lets see if you can guess it, listen carefully. She tries to open the
doors. They are all locked; she begins to weep. She walks weeping to a high
glass table and sits down on its lower ledge. She sits on a big golden key and
picks it up in surprise. She tries it on all the doors but it does not fit.
- A2: Too hard, yet another hint.
- B2: Ok. “Would you tell me, please, which way I ought to go from here?” “That
depends a good deal on where you want to get to”, said the Cat. “I don't much
care where” she replied.
- A3: Stop, I know the rest! “Then it doesn't matter which way you go”, said the
Cat. “So long as I get somewhere” she added as an explanation. “Oh, you're sure
to do that, if you only walk long enough.”
- B3: So, whats the title?
- A4: Alice in Wonderland!
- B4: Well done
## Swedish scripts (SE)
### Script 1
- A1: Hur är det?
- B1: Rätt upptagen. Jag förbereder mig inför min presentation om
marknadsföringsstrategi.
- A2: Då är du säkert riktigt stressad nu.
- B2: Det skulle man kunna säga.
### Script 2
- A1: Vilken bok läser du?
- B1: Jag ska läsa högt för dig. Försök gissa nu! “Dörren är ändå alldeles för
liten för mig. Och jag kan ju inte flyga upp igenom kaninhålet. Hon får syn på
en flaska på bordet. Drick mig. Bäst att vara försiktig. Om man dricker en
flaska där det står gift så får man nog ångra det förr eller senare. Men det
står ju bara drick mig.”
- A2: Det är för svårt! Får jag försöka igen?
- B2: Ok. “Åt vilket håll ska jag gå?” “Det beror på vart du vill komma”, sa
katten. “Det spelar ingen roll.” svarade hon.
- A3: Sluta, jag kan fortsättningen: “Då spelar det heller ingen roll åt vilket
håll du går." sa katten.
“…så länge som jag kommer någonstans”, fortsatte hon. “Åh, det kommer du att
göra”, sa katten, “bara du går tillräckligt långt”.
- B3: Så vad heter boken?
- A4: Alice i Underlandet
- B4: Jättebra

View File

@ -0,0 +1 @@
a7ffd2bd7662a7e1441c0a93c093020b717740ed

View File

@ -0,0 +1 @@
19e60178318377082faaf1be523b036271f32601

View File

@ -0,0 +1 @@
f21d76075296382320eb971ffe9902f8873190c2

View File

@ -0,0 +1 @@
f0d128c7af55269040c25f1c0077ffde0bcbd820

View File

@ -0,0 +1 @@
218639c3b0a175a74731a17e2d32a9389e227ad8

View File

@ -0,0 +1 @@
f93b08a3533530a061f9954ed7cc5eb7dc603cd9

View File

@ -0,0 +1 @@
5dcd1f6ba74b5a2e41540d30764e2d2557a82c32

View File

@ -0,0 +1 @@
d0f6d950f73d9543aa5f5bab7a9183c492aeffaf

View File

@ -0,0 +1 @@
c2308d932ed2e5f09e99f8e188d021da93e55e7a

View File

@ -0,0 +1 @@
3c249c8473674db6ae5b7c4d07d917de62e999d2

View File

@ -0,0 +1 @@
d743dcd223f2e36a4e6cf84a4634fb315559a11e

View File

@ -0,0 +1 @@
f2433fb1eaa4fa1cefed6a6b6146e3168dbbcf4b

View File

@ -0,0 +1 @@
6c3c4a629c0f1d00e71f6c1c477cfded0dc95cda

View File

@ -0,0 +1 @@
68923cdffa5787e8a6326231034d11f54f37bc5f

View File

@ -0,0 +1 @@
4e098a8874e59381fe3f7f92c81a7a7c14974b73

View File

@ -0,0 +1 @@
d80bef7fb99d1c46bc9b4acfa7a327a3453a3a47

View File

@ -0,0 +1 @@
04faffcb9a75699de0afd38fc91b79b7a465b36f

View File

@ -0,0 +1 @@
b33ae38ae15b2b58a3c31e0153c40c4b29a4ad80

View File

@ -0,0 +1 @@
1d77200eabdd967fbbf34b9f030d569b77b93a87

View File

@ -0,0 +1 @@
17bcdd785c94ba55d70a0b678ba420e427787bf5

View File

@ -0,0 +1 @@
dea503215f8b5017e09b4a08b1f2d7de569f5880

View File

@ -0,0 +1 @@
fd2318e8078e84cd63d4d80f51e2ee998a17840e

View File

@ -0,0 +1 @@
f594d1165fba62991a5746fd00f243ef791d3ba4

View File

@ -0,0 +1 @@
5a714c640e9672a0907b2a5b1a85af639a89cb6c

View File

@ -0,0 +1 @@
7e815bc751cae8d9569c6213fc7a62a53e2ba6d9

View File

@ -0,0 +1 @@
1eac7a3b6743aaea9c024317641a2323d4ded5db

View File

@ -0,0 +1 @@
b03dd575bc2554a31b40c8180cd11fb155235112

View File

@ -0,0 +1 @@
7aa500f2a890e22eccb28207ab53914da3780c95

View File

@ -0,0 +1 @@
534814989060a44b97e31dba2c4d42f362d1dd01

View File

@ -0,0 +1 @@
c36462d18f5cfd6cfef81e714d7b2308da128663