Conversational Speech dataset
Publicly available dataset of conversational speech audio recordings. This CL includes the following: - README.md: dataset description file, it also includes the scripts - *.wav.sha1: hash files for each audio track in the dataset The overall size of the wav files is ~36MB. The primary intended use of this dataset is in combination with the conversational speech tool (see https://chromium.googlesource.com/external/webrtc/+/master/webrtc/modules/audio_processing/test/conversational_speech/), using which longer recordings with custom turn switch timing can be created. BUG=webrtc:7218 Review-Url: https://codereview.webrtc.org/2869833002 Cr-Commit-Position: refs/heads/master@{#18068}
This commit is contained in:
parent
f9e5ebcac7
commit
77b376c094
@ -0,0 +1 @@
|
|||||||
|
3be3b93e5d2d346dc234c73de9d70f3363477c98
|
||||||
@ -0,0 +1 @@
|
|||||||
|
17cc4500202ff01f99eaedbf0e48313bc6bc9ed2
|
||||||
@ -0,0 +1 @@
|
|||||||
|
22301b5ffcd00550e5a1c17d4475269b913e0a40
|
||||||
@ -0,0 +1 @@
|
|||||||
|
1cdedd87f100a0b152caef29dd8bf80458fa3bfc
|
||||||
@ -0,0 +1 @@
|
|||||||
|
35c35112547b87ae62470e71eb6ecb6c3ede6d6b
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d2c4623a35cd9bd341d3f69e883ae552ba82128e
|
||||||
@ -0,0 +1 @@
|
|||||||
|
b08e20876c28d8f7b804121b0ada68bb979904fd
|
||||||
@ -0,0 +1 @@
|
|||||||
|
4e62ca314a3c509718c3bf9d4d274d7bfa9b1201
|
||||||
@ -0,0 +1 @@
|
|||||||
|
886c479f881fcf5ec1f89eb8d5f1d0aac906e1c8
|
||||||
@ -0,0 +1 @@
|
|||||||
|
b66c18917897b7ec189e18534eeead51656676ba
|
||||||
@ -0,0 +1 @@
|
|||||||
|
01f401a785ea59d293b9aa91720793ef246cda1d
|
||||||
@ -0,0 +1 @@
|
|||||||
|
0f33fc0072e8dd7be69a5e9ec64ae21d08d382b2
|
||||||
@ -0,0 +1 @@
|
|||||||
|
7774368c0285c2e37c3bee74fbc6b6287e48dc27
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d75a37ce09456d64e8e764dabf4e1c5f62a3021e
|
||||||
@ -0,0 +1 @@
|
|||||||
|
91b128d52a8c2790b4d7ddcef617373e3e9a0fd3
|
||||||
@ -0,0 +1 @@
|
|||||||
|
bca4edcb8007d93185052493569c24cc4248f684
|
||||||
@ -0,0 +1 @@
|
|||||||
|
a6a0265063f9e7b1ba830a35dda2f05399dec3c1
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d480ad07c11907ef985d2f726b783ed623945ab2
|
||||||
@ -0,0 +1 @@
|
|||||||
|
0060e174e4563bda2555bed1360e6e592d53385a
|
||||||
@ -0,0 +1 @@
|
|||||||
|
83e10191061c866177fc4ea846f4f6a96f76e7ea
|
||||||
@ -0,0 +1 @@
|
|||||||
|
cb14a4f9a3f7e01830d4de85830a379f1da5c7aa
|
||||||
@ -0,0 +1 @@
|
|||||||
|
eda398bd6c1e8df0c689df089f54e71c9db05c2e
|
||||||
@ -0,0 +1 @@
|
|||||||
|
18d61c3561eb3b66ce0937599f0e11ea0772a586
|
||||||
@ -0,0 +1 @@
|
|||||||
|
79c00d4487f8412cd18f735958deae625f7957ac
|
||||||
@ -0,0 +1 @@
|
|||||||
|
15506843d6a498334bb3dde70c891a069e7b907d
|
||||||
@ -0,0 +1 @@
|
|||||||
|
0ca6092950c59edd00a0b659c32c5f20f3a17076
|
||||||
@ -0,0 +1 @@
|
|||||||
|
5abaf53dc62f0e48e4b96cdfb89bc8e2ece358c5
|
||||||
@ -0,0 +1 @@
|
|||||||
|
03098c059e91ebc17fc4d24c665eeb5f5c61d0ad
|
||||||
@ -0,0 +1 @@
|
|||||||
|
8e02a66f1056dea5da74836637f929cee3eb7a25
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d8951ec345c2df067bd205deb185cd264a4d5cb9
|
||||||
@ -0,0 +1 @@
|
|||||||
|
be92633ad4ffba1421d29ba50aa707620ea9816e
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d1262e287cf97c41c2d8cd23f5d47b02cff56ed0
|
||||||
@ -0,0 +1 @@
|
|||||||
|
abdbd2a518c2fa26ffb922b4279aa32f6a1c7366
|
||||||
@ -0,0 +1 @@
|
|||||||
|
68d972235da27733e5f1e140e1163343b376e673
|
||||||
@ -0,0 +1 @@
|
|||||||
|
2025e8b85291774cae2ef9411a0b019e4535a821
|
||||||
@ -0,0 +1 @@
|
|||||||
|
4f8e399280eddde780d7d57d122949d3708c7e97
|
||||||
63
resources/audio_processing/conversational_speech/README.md
Normal file
63
resources/audio_processing/conversational_speech/README.md
Normal file
@ -0,0 +1,63 @@
|
|||||||
|
# WebRTC Conversational Speech dataset
|
||||||
|
|
||||||
|
This dataset includes *conversational speech* and *breathing* recordings,
|
||||||
|
and the audio files are named
|
||||||
|
``<langCode>_script<num>_<speakerGender>_sp<speakerId>_<turnId>.wav`` and
|
||||||
|
``breathing_<speakerGender>_sp<speakerId>_<num>.wav`` respectively.
|
||||||
|
|
||||||
|
The conversational speech part consists of *turns*. Each turn belongs to a
|
||||||
|
*script*, and a script involves two *speakers* named A and B.
|
||||||
|
|
||||||
|
The scripts have been designed spanning short and long turns. Also, the scripts
|
||||||
|
are gender neutral, which allows to let A and B turns be recorded by both male
|
||||||
|
and female speakers.
|
||||||
|
|
||||||
|
All the audio files have a single channel and a sample rate of 48000 Hz with 16
|
||||||
|
bit samples.
|
||||||
|
|
||||||
|
## English scripts (EN)
|
||||||
|
### Script 1
|
||||||
|
- A1: How is your day going?
|
||||||
|
- B1: Quite busy. I'm preparing for my presentation on our marketing strategy.
|
||||||
|
- A2: You must feel stressed out now.
|
||||||
|
- B2: That's an understatement.
|
||||||
|
### Script 2
|
||||||
|
- A1: Which book are you reading?
|
||||||
|
- B1: Let’s see if you can guess it, listen carefully. She tries to open the
|
||||||
|
doors. They are all locked; she begins to weep. She walks weeping to a high
|
||||||
|
glass table and sits down on its lower ledge. She sits on a big golden key and
|
||||||
|
picks it up in surprise. She tries it on all the doors but it does not fit.
|
||||||
|
- A2: Too hard, yet another hint.
|
||||||
|
- B2: Ok. “Would you tell me, please, which way I ought to go from here?” “That
|
||||||
|
depends a good deal on where you want to get to”, said the Cat. “I don't much
|
||||||
|
care where” she replied.
|
||||||
|
- A3: Stop, I know the rest! “Then it doesn't matter which way you go”, said the
|
||||||
|
Cat. “So long as I get somewhere” she added as an explanation. “Oh, you're sure
|
||||||
|
to do that, if you only walk long enough.”
|
||||||
|
- B3: So, what’s the title?
|
||||||
|
- A4: Alice in Wonderland!
|
||||||
|
- B4: Well done
|
||||||
|
## Swedish scripts (SE)
|
||||||
|
### Script 1
|
||||||
|
- A1: Hur är det?
|
||||||
|
- B1: Rätt upptagen. Jag förbereder mig inför min presentation om
|
||||||
|
marknadsföringsstrategi.
|
||||||
|
- A2: Då är du säkert riktigt stressad nu.
|
||||||
|
- B2: Det skulle man kunna säga.
|
||||||
|
### Script 2
|
||||||
|
- A1: Vilken bok läser du?
|
||||||
|
- B1: Jag ska läsa högt för dig. Försök gissa nu! “Dörren är ändå alldeles för
|
||||||
|
liten för mig. Och jag kan ju inte flyga upp igenom kaninhålet. Hon får syn på
|
||||||
|
en flaska på bordet. Drick mig. Bäst att vara försiktig. Om man dricker en
|
||||||
|
flaska där det står gift så får man nog ångra det förr eller senare. Men det
|
||||||
|
står ju bara drick mig.”
|
||||||
|
- A2: Det är för svårt! Får jag försöka igen?
|
||||||
|
- B2: Ok. “Åt vilket håll ska jag gå?” “Det beror på vart du vill komma”, sa
|
||||||
|
katten. “Det spelar ingen roll.” svarade hon.
|
||||||
|
- A3: Sluta, jag kan fortsättningen: “Då spelar det heller ingen roll åt vilket
|
||||||
|
håll du går." sa katten.
|
||||||
|
“…så länge som jag kommer någonstans”, fortsatte hon. “Åh, det kommer du att
|
||||||
|
göra”, sa katten, “bara du går tillräckligt långt”.
|
||||||
|
- B3: Så vad heter boken?
|
||||||
|
- A4: Alice i Underlandet
|
||||||
|
- B4: Jättebra
|
||||||
@ -0,0 +1 @@
|
|||||||
|
a7ffd2bd7662a7e1441c0a93c093020b717740ed
|
||||||
@ -0,0 +1 @@
|
|||||||
|
19e60178318377082faaf1be523b036271f32601
|
||||||
@ -0,0 +1 @@
|
|||||||
|
f21d76075296382320eb971ffe9902f8873190c2
|
||||||
@ -0,0 +1 @@
|
|||||||
|
f0d128c7af55269040c25f1c0077ffde0bcbd820
|
||||||
@ -0,0 +1 @@
|
|||||||
|
218639c3b0a175a74731a17e2d32a9389e227ad8
|
||||||
@ -0,0 +1 @@
|
|||||||
|
f93b08a3533530a061f9954ed7cc5eb7dc603cd9
|
||||||
@ -0,0 +1 @@
|
|||||||
|
5dcd1f6ba74b5a2e41540d30764e2d2557a82c32
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d0f6d950f73d9543aa5f5bab7a9183c492aeffaf
|
||||||
@ -0,0 +1 @@
|
|||||||
|
c2308d932ed2e5f09e99f8e188d021da93e55e7a
|
||||||
@ -0,0 +1 @@
|
|||||||
|
3c249c8473674db6ae5b7c4d07d917de62e999d2
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d743dcd223f2e36a4e6cf84a4634fb315559a11e
|
||||||
@ -0,0 +1 @@
|
|||||||
|
f2433fb1eaa4fa1cefed6a6b6146e3168dbbcf4b
|
||||||
@ -0,0 +1 @@
|
|||||||
|
6c3c4a629c0f1d00e71f6c1c477cfded0dc95cda
|
||||||
@ -0,0 +1 @@
|
|||||||
|
68923cdffa5787e8a6326231034d11f54f37bc5f
|
||||||
@ -0,0 +1 @@
|
|||||||
|
4e098a8874e59381fe3f7f92c81a7a7c14974b73
|
||||||
@ -0,0 +1 @@
|
|||||||
|
d80bef7fb99d1c46bc9b4acfa7a327a3453a3a47
|
||||||
@ -0,0 +1 @@
|
|||||||
|
04faffcb9a75699de0afd38fc91b79b7a465b36f
|
||||||
@ -0,0 +1 @@
|
|||||||
|
b33ae38ae15b2b58a3c31e0153c40c4b29a4ad80
|
||||||
@ -0,0 +1 @@
|
|||||||
|
1d77200eabdd967fbbf34b9f030d569b77b93a87
|
||||||
@ -0,0 +1 @@
|
|||||||
|
17bcdd785c94ba55d70a0b678ba420e427787bf5
|
||||||
@ -0,0 +1 @@
|
|||||||
|
dea503215f8b5017e09b4a08b1f2d7de569f5880
|
||||||
@ -0,0 +1 @@
|
|||||||
|
fd2318e8078e84cd63d4d80f51e2ee998a17840e
|
||||||
@ -0,0 +1 @@
|
|||||||
|
f594d1165fba62991a5746fd00f243ef791d3ba4
|
||||||
@ -0,0 +1 @@
|
|||||||
|
5a714c640e9672a0907b2a5b1a85af639a89cb6c
|
||||||
@ -0,0 +1 @@
|
|||||||
|
7e815bc751cae8d9569c6213fc7a62a53e2ba6d9
|
||||||
@ -0,0 +1 @@
|
|||||||
|
1eac7a3b6743aaea9c024317641a2323d4ded5db
|
||||||
@ -0,0 +1 @@
|
|||||||
|
b03dd575bc2554a31b40c8180cd11fb155235112
|
||||||
@ -0,0 +1 @@
|
|||||||
|
7aa500f2a890e22eccb28207ab53914da3780c95
|
||||||
@ -0,0 +1 @@
|
|||||||
|
534814989060a44b97e31dba2c4d42f362d1dd01
|
||||||
@ -0,0 +1 @@
|
|||||||
|
c36462d18f5cfd6cfef81e714d7b2308da128663
|
||||||
Loading…
x
Reference in New Issue
Block a user