Transformer based NER model

# Discussion 🗣
Hello all!

Continuing a discussion with @EasonC13 and @niallroche, I am opening this thread to keep track of the different approaches to fine-tune a transformer model (BERT or some variation of it like ALBERT) and its usage with [Snorkel](https://github.com/snorkel-team/snorkel).


## Context

@EasonC13 already did some work to generate a dataset with multiple NER labels using Keras here: https://github.com/accordproject/labs-cicero-classify/blob/dev/Practice/keras/keras_decompose_NER_model.ipynb

To replicate the above, we can:
- [Use pytorch by following the example of huggingface](https://github.com/huggingface/transformers/tree/master/examples/pytorch/token-classification)
- [Use Spacy version 3, which comes with a friendly api to work with transformer models and do easy preprocessing of the data](https://explosion.ai/blog/spacy-transformers)

## Detailed Description

If we go with spacy, [Snorkel](https://github.com/snorkel-team/snorkel) has [compatibility with it out of the box](https://snorkel.readthedocs.io/en/v0.9.7/packages/_autosummary/labeling/snorkel.labeling.lf.nlp.NLPLabelingFunction.html). However, it is [limited to version 2](https://github.com/snorkel-team/snorkel/issues/1621) and depending of our needs, we can do the pull request to facilitate the implementation spacy v3 in Snorkel. Although, we can go without having to do it. 

Still, we can create a labelling function with our fine-tuned transformer model and use it as a [custom labelling function](https://snorkel.readthedocs.io/en/v0.9.7/packages/_autosummary/labeling/snorkel.labeling.labeling_function.html) while using Snorkel's implementation of spacy for the preprocessing needed.

Another thing to consider is the way to do inference, having in mind the high run-time cost in production of a transformer model, with the fine-tuned transformer model being used as a labelling function:
- Do batch inference of the labels
- Use a lightweight transformer model variation of BERT like [ALBERT](https://github.com/google-research/albert). [huggingface has an example of how to implement it too](https://huggingface.co/albert-base-v2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transformer based NER model #2

Discussion 🗣

Context

Detailed Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Transformer based NER model #2

Description

Discussion 🗣

Context

Detailed Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions