Create a Transformer Encoder Block
Implement a single Transformer encoder block:
- Multi-head self-attention.
- Layer normalization.
- Feedforward network.
Compare output with nn.TransformerEncoderLayer
.
Implement a single Transformer encoder block:
Compare output with nn.TransformerEncoderLayer
.