[Relax][PyTorch] Add support for bidirectional LSTM by guan404ming · Pull Request #18516 · apache/tvm

guan404ming · 2025-11-27T06:46:11Z

How

implement bidirectional LSTM

guan404ming · 2025-11-27T08:40:07Z

cc @tlopex

tests/python/relax/test_frontend_from_exported_program.py

mshr-h · 2025-11-27T10:04:35Z

/gemini review

gemini-code-assist

Code Review

This pull request adds support for bidirectional LSTMs in the PyTorch frontend, which is a great enhancement. The implementation involves refactoring the LSTM logic into a new _lstm_cell_unroll helper function and updating the _lstm operator to handle both forward and backward passes. The accompanying tests have also been significantly improved by adding a numerical verification helper and expanding test coverage to include various LSTM configurations. I've identified a critical regression where the fallback for missing LSTM parameters was removed, which could lead to crashes. I've also suggested adding a warning for when a default hidden_size is used, to prevent silent errors. Overall, this is a solid contribution with good testing practices.

gemini-code-assist · 2025-11-27T10:11:29Z

python/tvm/relax/frontend/torch/exported_program_translator.py

+        weight_ih_fwd = params[0] if params else None
+        weight_hh_fwd = params[1] if params and len(params) > 1 else None
+        bias_ih_fwd = params[2] if params and has_biases and len(params) > 2 else None
+        bias_hh_fwd = params[3] if params and has_biases and len(params) > 3 else None
+
+        if bidirectional and params and len(params) >= params_per_direction * 2:
+            weight_ih_bwd = params[params_per_direction]
+            weight_hh_bwd = params[params_per_direction + 1]
+            bias_ih_bwd = params[params_per_direction + 2] if has_biases else None
+            bias_hh_bwd = params[params_per_direction + 3] if has_biases else None
        else:
-            # Fallback: create zero weights
-            weight_ih = self.block_builder.emit(
-                relax.op.zeros(relax.ShapeExpr((4 * hidden_size, input_size)), dtype)
-            )
-            weight_hh = self.block_builder.emit(
-                relax.op.zeros(relax.ShapeExpr((4 * hidden_size, hidden_size)), dtype)
-            )
-            bias_ih = None
-            bias_hh = None
-        # Initialize hidden and cell states
+            weight_ih_bwd = None
+            weight_hh_bwd = None
+            bias_ih_bwd = None
+            bias_hh_bwd = None


This change removes the fallback logic for creating zero-initialized weights when LSTM parameters are not provided. The new implementation assigns None to weight variables, which will cause a crash inside _lstm_cell_unroll when relax.op.permute_dims is called on a None value. This appears to be a regression from the previous behavior.

Please consider restoring the fallback logic to create zero weights for both forward and backward directions if they are not available in params.

gemini-code-assist · 2025-11-27T10:11:29Z

python/tvm/relax/frontend/torch/exported_program_translator.py

        else:
            # Fallback to a default hidden size
            hidden_size = 16


The code falls back to a default hidden_size of 16 when it cannot be inferred from the model parameters. This could lead to unexpected behavior or errors if the actual model has a different hidden size. It would be beneficial to add a warning to notify the user about this fallback, so they are aware of the potential discrepancy.

guan404ming · 2025-11-28T02:50:10Z

Thanks for your suggestions, I've applied review and updated the PR.

mshr-h

LGTM. Thanks!

guan404ming · 2025-11-28T08:28:26Z

Thanks!

Implement bidirectional LSTM

f613704

guan404ming marked this pull request as ready for review November 27, 2025 08:37

mshr-h requested changes Nov 27, 2025

View reviewed changes

tests/python/relax/test_frontend_from_exported_program.py Outdated Show resolved Hide resolved

tests/python/relax/test_frontend_from_exported_program.py Outdated Show resolved Hide resolved

tests/python/relax/test_frontend_from_exported_program.py Show resolved Hide resolved

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

Update tests and add fallback

4b01797

guan404ming requested a review from mshr-h November 28, 2025 02:49

Merge branch 'main' into bidirectional-lstm

127d0b9

mshr-h approved these changes Nov 28, 2025

View reviewed changes

mshr-h merged commit 1c77db7 into apache:main Nov 28, 2025
13 checks passed

guan404ming deleted the bidirectional-lstm branch November 28, 2025 10:01

ysh329 mentioned this pull request Feb 1, 2026

[Release] v0.23.0 Release Candidate Notes #18701

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relax][PyTorch] Add support for bidirectional LSTM#18516

[Relax][PyTorch] Add support for bidirectional LSTM#18516
mshr-h merged 3 commits intoapache:mainfrom
guan404ming:bidirectional-lstm

guan404ming commented Nov 27, 2025 •

edited

Loading

Uh oh!

guan404ming commented Nov 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mshr-h commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 27, 2025

Uh oh!

gemini-code-assist bot Nov 27, 2025

Uh oh!

guan404ming commented Nov 28, 2025

Uh oh!

mshr-h left a comment

Uh oh!

guan404ming commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

guan404ming commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How

Uh oh!

guan404ming commented Nov 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mshr-h commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

guan404ming commented Nov 28, 2025

Uh oh!

mshr-h left a comment

Choose a reason for hiding this comment

Uh oh!

guan404ming commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

guan404ming commented Nov 27, 2025 •

edited

Loading