transformers.fx.symbolic_trace supports inputs_embeds #31574

fxmarty · 2024-06-24T14:59:58Z

We should prompt users to use torch.export.export instead.

HuggingFaceDocBuilderDev · 2024-06-24T15:19:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for working in this fix!

I think we should be a bit smarter about the input preparation here

tests/test_modeling_common.py

amyeroberts · 2024-06-24T15:41:40Z

tests/test_modeling_common.py

+            if "inputs_embeds" in inspect.signature(model.forward).parameters:
+                inputs_to_test.append(
+                    {
+                        "inputs_embeds": torch.rand(
+                            3, 5, model.config.hidden_size, dtype=torch.float, device=torch_device
+                        )
+                    }
+                )


shouldn't this be handled by _prepare_for_class? Especially as it would avoid this hard-coded 3, 5

Happy to, but inputs_embeds does not seem to be tested (AFAIK it is not in general in prepare_config_and_inputs_for_common). Do you suggest to edit all the tests/models/**/test_modeling_*.py to get inputs_embeds?

amyeroberts · 2024-06-24T15:42:09Z

tests/test_modeling_common.py

@@ -1327,16 +1328,30 @@ def _create_and_check_torch_fx_tracing(self, config, inputs_dict, output_loss=Fa
                            (past_mask, inputs_to_test[1]["attention_mask"]), dim=1
                        )

+            if "inputs_embeds" in inspect.signature(model.forward).parameters:


Won't this clash with input_ids in the inputs?

No as we add a new set of inputs to test, independent of the previous that uses input_ids.

My understanding of the test set-up is that the inputs to the model will now include both input_ids and input_embeds, which shouldn't be the case.

No, inputs_to_test is a list of dict, with each dict being one input to the model.

Co-authored-by: amyeroberts <[email protected]>

ArthurZucker · 2024-06-27T10:37:21Z

tests/test_modeling_common.py


                if model.__class__.__name__ in set(MODEL_FOR_SEQUENCE_CLASSIFICATION_MAPPING_NAMES.values()) and (
                    not hasattr(model.config, "problem_type") or model.config.problem_type is None
                ):
                    model.config.problem_type = "single_label_classification"

-                traced_model = symbolic_trace(model, input_names)
+                model.config.use_cache = "past_key_values" in input_names_to_trace


what about mamba here? uses cache_params 😅

mamba is afaik not supported in symbolic_trace

ydshieh · 2024-07-08T12:00:50Z

Hi @fxmarty

Could you take a look on the following (see https://app.circleci.com/pipelines/github/huggingface/transformers/97223/workflows/f7cf9ddb-909b-4544-994a-18aaadbb77ac/jobs/1286547)?

FAILED tests/models/blenderbot_small/test_modeling_blenderbot_small.py::BlenderbotSmallModelTest::test_torch_fx - ValueError: You have to specify either input_ids or inputs_embeds
FAILED tests/models/blenderbot_small/test_modeling_blenderbot_small.py::BlenderbotSmallModelTest::test_torch_fx_output_loss - ValueError: You have to specify either input_ids or inputs_embeds
FAILED tests/models/pegasus/test_modeling_pegasus.py::PegasusModelTest::test_torch_fx - ValueError: You have to specify either input_ids or inputs_embeds
FAILED tests/models/pegasus/test_modeling_pegasus.py::PegasusModelTest::test_torch_fx_output_loss - ValueError: You have to specify either input_ids or inputs_embeds
FAILED tests/models/m2m_100/test_modeling_m2m_100.py::M2M100ModelTest::test_torch_fx - ValueError: You have to specify either input_ids or inputs_embeds
FAILED tests/models/m2m_100/test_modeling_m2m_100.py::M2M100ModelTest::test_torch_fx_output_loss - ValueError: You have to specify either input_ids or inputs_embeds

symbolic trace supports inputs_embeds

92f6d3b

fxmarty requested review from amyeroberts and ArthurZucker June 24, 2024 15:00

amyeroberts reviewed Jun 24, 2024

View reviewed changes

fxmarty and others added 2 commits June 24, 2024 18:09

fix test?

11ffa68

Update tests/test_modeling_common.py

6956a93

Co-authored-by: amyeroberts <[email protected]>

ArthurZucker approved these changes Jun 27, 2024

View reviewed changes

fxmarty merged commit ba74370 into huggingface:main Jul 8, 2024
19 checks passed

This was referenced Jul 8, 2024

Control flow issue with symbolic_trace when using inputs_embeds in LlamaForCausalLM #31414

Closed

Control flow issue with symbolic_trace when using inputs_embeds in MistralForCausalLM #31200

Closed

fxmarty mentioned this pull request Jul 8, 2024

FX symbolic_trace: do not test decoder_inputs_embeds #31840

Merged

fxmarty mentioned this pull request Jul 16, 2024

FX tracer doen't work when requesting non-default input argument #31958

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformers.fx.symbolic_trace supports inputs_embeds #31574

transformers.fx.symbolic_trace supports inputs_embeds #31574

fxmarty commented Jun 24, 2024

HuggingFaceDocBuilderDev commented Jun 24, 2024

amyeroberts left a comment

amyeroberts Jun 24, 2024

fxmarty Jun 24, 2024

amyeroberts Jun 24, 2024

fxmarty Jun 24, 2024

amyeroberts Jun 25, 2024

fxmarty Jul 8, 2024

ArthurZucker Jun 27, 2024

fxmarty Jun 27, 2024

ydshieh commented Jul 8, 2024

transformers.fx.symbolic_trace supports inputs_embeds #31574

transformers.fx.symbolic_trace supports inputs_embeds #31574

Conversation

fxmarty commented Jun 24, 2024

HuggingFaceDocBuilderDev commented Jun 24, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ydshieh commented Jul 8, 2024