Describe the key components and training process of a Transformer model used for text generation.
Naresh Beniwal