Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

lpizzinidev · 2024-02-18T16:24:34Z

Updates the "Automatic Speech Recognition using CTC" example to support Keras v3.

fchollet

Thanks for the PR!

fchollet · 2024-02-19T01:39:05Z

examples/audio/ctc_asr.py

@@ -244,16 +249,74 @@ def encode_single_sample(wav_file, label):
 """


+# Reference: https://github.com/keras-team/keras/blob/ec67b760ba25e1ccc392d288f7d8c6e9e153eea2/keras/legacy/backend.py#L674-L711
+def ctc_label_dense_to_sparse(labels, label_lengths):


Rather than rewriting this code, you can just use the built-in Keras 3 loss function keras.losses.CTC. I expect it will also enable the code example to run with all backends.

Thanks for the feedback 👍
After removing the legacy code we still have some references to tf in the example and I'm not sure this can be made backend-agnostic.
Please let me know if I should substitute the remaining tf references.

fchollet

LGTM, thank you! You can add the generated files.

fchollet · 2024-02-24T18:12:29Z

examples/audio/ctc_asr.py

@@ -320,7 +307,7 @@ def build_model(input_dim, output_dim, rnn_layers=5, rnn_units=128):
    # Optimizer
    opt = keras.optimizers.Adam(learning_rate=1e-4)
    # Compile the model and return
-    model.compile(optimizer=opt, loss=CTCLoss)
+    model.compile(optimizer=opt, loss=keras.losses.ctc)


Prefer using CTC() (ends up running the same thing but it's more idiomatic)

fchollet · 2024-02-24T18:13:12Z

examples/audio/ctc_asr.py

+    input_length = tf.cast(input_length, tf.int32)
+
+    if greedy:
+        (decoded, log_prob) = tf.nn.ctc_greedy_decoder(


So, we're going to have to use TF for this and ctc_beam_search_decoder I guess, unless we implement them as new backend ops.

Again, thanks for the feedback 👍
I created an issue to address this.
Please let me know if I should change the description or add/remove details.
Thanks!

Updated Automatic Speech Recognition using CTC example for Keras v3

6d9e3f0

github-actions bot assigned sachinprasadhs Feb 18, 2024

fchollet reviewed Feb 19, 2024

View reviewed changes

use keras.losses.CTC function

b11c690

fchollet reviewed Feb 24, 2024

View reviewed changes

lpizzinidev added 2 commits February 25, 2024 15:51

updated autogenerated files

f207998

Merge branch 'master' into ctc-asr-example-v3

b4108fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

lpizzinidev commented Feb 18, 2024

fchollet left a comment

fchollet Feb 19, 2024

lpizzinidev Feb 24, 2024

fchollet left a comment

fchollet Feb 24, 2024

fchollet Feb 24, 2024

lpizzinidev Feb 25, 2024

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

Are you sure you want to change the base?

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

Conversation

lpizzinidev commented Feb 18, 2024

fchollet left a comment

Choose a reason for hiding this comment

fchollet Feb 19, 2024

Choose a reason for hiding this comment

lpizzinidev Feb 24, 2024

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

fchollet Feb 24, 2024

Choose a reason for hiding this comment

fchollet Feb 24, 2024

Choose a reason for hiding this comment

lpizzinidev Feb 25, 2024

Choose a reason for hiding this comment