webrtc_m130/gru_in.dat.sha1 at 4e9c5b592aceceb5209fa0610256e27ae2c5e3a1 - webrtc_m130 - Gitea: Git with a cup of tea

admin/webrtc_m130

Alessio Bazzica 4e9c5b592a RNN VAD: GRU layer optimized

Using `VectorMath::DotProduct()` in GatedRecurrentLayer to reuse existing
SIMD optimizations. Results:
- When SSE2/AVX2 is avilable, the GRU layer takes 40% of the unoptimized
  code
- The realtime factor for the VAD improved as follows
  - SSE2: from 570x to 630x
  - AVX2: from 610x to 680x

This CL also improved the GRU layer benchmark by (i) benchmarking a GRU
layer havibng the same size of that used in the VAD and (ii) by prefetching
a long input sequence.

Bug: webrtc:10480
Change-Id: I9716b15661e4c6b81592b4cf7c172d90e41b5223
Reviewed-on: https://webrtc-review.googlesource.com/c/src/+/195545
Reviewed-by: Per Åhgren <peah@webrtc.org>
Commit-Queue: Alessio Bazzica <alessiob@webrtc.org>
Cr-Commit-Position: refs/heads/master@{#32803}

2020-12-08 15:37:38 +00:00

1 line

40 B

Plaintext

Raw Blame History

402abf7a4e5d35abb78906fff2b3f4d8d24aa629