Construct a Named Entity Recognition App with Streamlit | by Nikos Kafritsas | Aug, 2022

September 1, 2022

1

From constructing the app to deployment — with code included

NER App with Streamlit, picture by writer (Supply)

In my earlier article, we fine-tuned a Named Entity Recognition (NER) mannequin, educated on the wnut_17[1] dataset.

On this article, we present step-by-step find out how to combine this mannequin with Streamlit and deploy it utilizing HugginFace Areas. The purpose of this app is to tag enter sentences per person request in actual time.

Additionally, take into accout, that opposite to trivial ML fashions, deploying a big language mannequin on Streamlit is difficult. We additionally deal with these challenges.

Let’s dive in!

Streamlit is an easy-to-use device for creating interactive purposes that sit on high of an information science challenge.

There are related ML-friendly instruments like Sprint and Gradio. Each has its strengths. For instance, Gradio has a tremendous drag-and-drop part, appropriate for picture classification fashions.

Normally, I favor Streamlit as a result of:

It has a spectacular trajectory to date — in the course of the previous yr, Streamlit has been releasing main updates at the least as soon as a month.
It has a robust group. Members at dialogue boards are super-helpful. Additionally, you possibly can add your app without cost on Streamlit Cloud. In case your app is fascinating, the group managers will attain out to you and have your app on the Streamlit web site! They might even ship you presents!

Other than development and robust group, Streamlit is a fully-fledged device, appropriate for interactive purposes in each knowledge science area.

Subsequent, let’s construct our app!

The total instance may be discovered right here.

This text focuses on constructing and deploying our mannequin with Streamlit.

If you wish to study extra about how the mannequin is produced, be happy to examine my earlier submit.

There may be one change although: We use the roberta-large mannequin from HugginFace as a substitute of bert-base. RoBERTa[2] launched just a few novelties like dynamic masking, which make RoBERTa superior to BERT.

Libraries

First, we want the next libraries. For readability, check out the necessities.txt file:

pytorch-lightning==0.9.0
torch==1.10.0
torchtext==0.8.0
torchvision==0.11.1 
datasets==2.3.2
numpy==1.20.3
pandas==1.3.5
streamlit==1.11.1
transformers==4.12.5