Skip to content
This repository was archived by the owner on Nov 16, 2023. It is now read-only.
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
11551c5
add minilm components
daden-ms Apr 10, 2020
da0fb13
Merge branch 'staging' of https://github.com/microsoft/nlp-recipes in…
daden-ms Apr 10, 2020
f43070f
add minilm into tests
daden-ms Apr 11, 2020
bb1b5ce
add top_n when creating S2SAbsSumDataset; minor change according to t…
daden-ms Apr 14, 2020
bf218ec
handle uncased model; add bertconfig for minilm
daden-ms Apr 14, 2020
8dbb0f9
add printout to inform user about using the cached file
daden-ms Apr 16, 2020
73961db
add minilm notebook
daden-ms Apr 16, 2020
3508bb2
add integration test
daden-ms Apr 16, 2020
b04b383
update documentation
daden-ms Apr 16, 2020
910ff02
increase the duration of the test before canceling due to timeout
daden-ms Apr 16, 2020
ac90fd7
modify the notebook according to review comments
daden-ms Apr 17, 2020
b2183d8
fix notebook
daden-ms Apr 17, 2020
4a46755
add timer for fitting time
daden-ms Apr 21, 2020
886e44b
cleanup and add documentation
daden-ms Apr 21, 2020
75e70a2
Merge pull request #584 from microsoft/daden/minilm
daden-ms Apr 21, 2020
eab92fa
update NER example
saidbleik May 5, 2020
d331f32
minor edit
saidbleik May 5, 2020
71921da
update NER example
saidbleik May 8, 2020
2c3b38b
update ner utils
saidbleik May 8, 2020
e418fc9
update seq classification utils
saidbleik May 8, 2020
c1aaf25
common updates
saidbleik May 8, 2020
8f965d4
qa updates
saidbleik May 8, 2020
78726ff
update summarization models
saidbleik May 8, 2020
9d9ef20
resolve conflicts
saidbleik May 11, 2020
477db37
Merge branch 'staging' into bleik/add-models
saidbleik May 11, 2020
ad32375
fix ner test
saidbleik May 12, 2020
2e92e31
Merge branch 'bleik/add-models' of https://github.com/microsoft/nlp-r…
saidbleik May 12, 2020
cd50ef8
Update gpu_unit_tests_linux.yml for Azure Pipelines
saidbleik May 13, 2020
b34a715
Merge branch 'staging' into azure-pipelines
saidbleik May 13, 2020
30afaa4
Merge pull request #587 from microsoft/azure-pipelines
saidbleik May 13, 2020
e02e3b5
Merge pull request #586 from microsoft/bleik/add-models
saidbleik May 13, 2020
9d47434
Fixed capitalization in README.md
mikaelsouza May 13, 2020
a363eca
Added references table header
mikaelsouza May 13, 2020
aa7340c
Merge pull request #588 from mikaelsouza/fixing-capitalization-and-ta…
saidbleik May 13, 2020
af4df5c
add load model fn
saidbleik May 20, 2020
db9f076
minor edits
saidbleik May 20, 2020
ab25969
Merge pull request #590 from microsoft/bleik/add-models
saidbleik May 20, 2020
File filter

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ The following is a summary of the commonly used NLP scenarios covered in the rep
|-------------------------| ------------------- |-------|---|
|Text Classification |BERT, XLNet, RoBERTa| Text classification is a supervised learning method of learning and predicting the category or the class of a document given its text content. |English, Hindi, Arabic|
|Named Entity Recognition |BERT| Named entity recognition (NER) is the task of classifying words or key phrases of a text into predefined entities of interest. |English|
|Text Summarization|BERTSumExt <br> BERTSumAbs <br> UniLM (s2s-ft)|Text summarization is a language generation task of summarizing the input text into a shorter paragraph of text.|English
|Text Summarization|BERTSumExt <br> BERTSumAbs <br> UniLM (s2s-ft) <br> MiniLM |Text summarization is a language generation task of summarizing the input text into a shorter paragraph of text.|English
|Entailment |BERT, XLNet, RoBERTa| Textual entailment is the task of classifying the binary relation between two natural-language texts, *text* and *hypothesis*, to determine if the *text* agrees with the *hypothesis* or not. |English|
|Question Answering |BiDAF, BERT, XLNet| Question answering (QA) is the task of retrieving or generating a valid answer for a given query in natural language, provided with a passage related to the query. |English|
|Sentence Similarity |BERT, GenSen| Sentence similarity is the process of computing a similarity score given a pair of text documents. |English|
Expand Down Expand Up @@ -89,9 +89,9 @@ We hope that the open source community would contribute to the content and bring
## References
The following is a list of related repositories that we like and think are useful for NLP tasks.

|||
|Repository|Description|
|---|---|
|[transformers](https://github.com/huggingface/transformers)|A great PyTorch library from Hugging Face with implementations of popular transformer-based models. We've been using their package extensively in this repo and greatly appreciate their effort.|
|[Transformers](https://github.com/huggingface/transformers)|A great PyTorch library from Hugging Face with implementations of popular transformer-based models. We've been using their package extensively in this repo and greatly appreciate their effort.|
|[Azure Machine Learning Notebooks](https://github.com/Azure/MachineLearningNotebooks/)|ML and deep learning examples with Azure Machine Learning.|
|[AzureML-BERT](https://github.com/Microsoft/AzureML-BERT)|End-to-end recipes for pre-training and fine-tuning BERT using Azure Machine Learning service.|
|[MASS](https://github.com/microsoft/MASS)|MASS: Masked Sequence to Sequence Pre-training for Language Generation.|
Expand Down
Loading