parameters of Linear should be swapped #414

vineetk1 · 2018-12-17T20:11:45Z

File: https://github.com/pytorch/fairseq/blob/master/fairseq/models/lstm.py
Seems that in Line 269
self.input_proj = Linear(input_embed_dim, output_embed_dim, bias=False)
the parameters input_embed_dim and output_embed_dim should be swapped as follows:
self.input_proj = Linear(output_embed_dim, input_embed_dim, bias=False)

If Line 269 is not changed then Lines 277 and 280:
x = self.input_proj(input)
attn_scores = (source_hids * x.unsqueeze(0)).sum(dim=2)
should be replaced with the following two lines:
source_hids = self.input_proj(source_hids)
attn_scores = (source_hids * input.unsqueeze(0)).sum(dim=2)

The text was updated successfully, but these errors were encountered:

Summary: Pull Request resolved: facebookresearch#470 Differential Revision: D13803964 Pulled By: myleott fbshipit-source-id: 91b66599e9a539833fcedea07c608b349ba3b449

facebook-github-bot closed this as completed in 9196c0b Jan 24, 2019

edunov pushed a commit that referenced this issue Mar 29, 2019

Revert sequence generator changes (#414)

ee7daf7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parameters of Linear should be swapped #414

parameters of Linear should be swapped #414

vineetk1 commented Dec 17, 2018

parameters of Linear should be swapped #414

parameters of Linear should be swapped #414

Comments

vineetk1 commented Dec 17, 2018