Skip to content

Commit 47d428d

Browse files
authored
Adding a code2vec trained model that was trained on Java-large
1 parent c79da6f commit 47d428d

File tree

1 file changed

+14
-0
lines changed

1 file changed

+14
-0
lines changed

README.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -105,6 +105,9 @@ tar -xvzf java14m_model_trainable.tar
105105

106106
This model weights more than twice than the stripped version, and it is recommended only if you wish to continue training a model which is already trained. To continue training this trained model, use the `--load` flag to load the trained model; the `--data` flag to point to the new dataset to train on; and the `--save` flag to provide a new save path.
107107

108+
#### A model that was trained on the Java-large dataset
109+
We provide an additional code2vec model that was trained on the "Java-large" dataset (this dataset was introduced in the code2seq paper). See [Java-large](#java-large-compressed-72gb-extracted-37gb)
110+
108111
#### Training a model from scratch
109112
To train a model from scratch:
110113
* Edit the file [train.sh](train.sh) to point it to the right preprocessed data. By default,
@@ -318,6 +321,17 @@ A dataset of the 9500 top-starred Java projects from GitHub that were created
318321
since January 2007. It contains 9000 projects for training, 200 for validation and 300 for
319322
testing. Overall, it contains about 16M examples.
320323

324+
Additionally, we provide a trained code2vec model that was trained on the Java-large dataset (this model was not part of the original code2vec paper, but was later used as a baseline in the code2seq paper which introduced this dataset).
325+
Trainable model (3.5 GB):
326+
```
327+
wget https://code2vec.s3.amazonaws.com/model/java-large-model.tar.gz
328+
```
329+
330+
"Released model" (1.4 GB, cannot be further trained).
331+
```
332+
wget https://code2vec.s3.amazonaws.com/model/java-large-released-model.tar.gz
333+
```
334+
321335
## Citation
322336

323337
[code2vec: Learning Distributed Representations of Code](https://urialon.cswp.cs.technion.ac.il/wp-content/uploads/sites/83/2018/12/code2vec-popl19.pdf)

0 commit comments

Comments
 (0)