Name	Name	Last commit message	Last commit date
parent directory ..
__pycache__	__pycache__
docs	docs
examples	examples
fairseq	fairseq
pre-process	pre-process
scripts	scripts
tests	tests
LICENSE	LICENSE
README.md	README.md
build_vocab.py	build_vocab.py
eval_lm.py	eval_lm.py
generate.py	generate.py
hubconf.py	hubconf.py
ibox_interactive.py	ibox_interactive.py
interactive.py	interactive.py
preprocess.py	preprocess.py
score.py	score.py
setup.py	setup.py
train.py	train.py

Name

Last commit message

Last commit date

__pycache__

Introduction

This page gives an introduction about the project "Deep NMT with pre-training". Main structure is forked from https://github.com/pytorch/fairseq, and we also incorporate some advantages of MASS(https://github.com/microsoft/MASS), XLM(https://github.com/facebookresearch/XLM).

Features

We support Bert-like pre-training and finetuning in Deep NMT. To stabilize the training of Deep NMT, we utilize the "Depth-init" initialization method proposed by "https://arxiv.org/abs/1908.11365"
We support multi-lingual NMT system with weight-sharing.
We support Mass pre-training for NMT.
Other features provided by Fairseq.

examples

We provide examples for pre-train 12-layer encoders for English, and then fineting the pre-trained model on English-German translation tasks with 12-layer encoders and 12-layer decoders.

Requirements

Pytorch version > 1.0.0
Python version >=3.6
For training new models, you'll also need an NVIDIA GPU and NCCL
Other requirements by Fairseq.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Introduction

Features

examples

Requirements

FilesExpand file tree

codes_src

Directory actions

More options

Directory actions

More options

Latest commit

History

codes_src

Folders and files

parent directory

README.md

Introduction

Features

examples

Requirements