← Back to Data Science

All Topics

Advertisement

Learn/Data Science/Data Science Applications

Named Entity Recognition

Topic: NLP

Advertisement

NER Fundamentals

Identify and classify named entities.

Task

Input: sequence of tokens. Output: entity labels per token. BIO tagging: B-entity, I-entity, O.

Common types: PERSON, ORG, LOC, DATE, MONEY.

Approaches

Rule-based: dictionary matching. Statistical: HMM, CRF. Neural: Bi-LSTM-CRF, BERT-based.

CRF models label sequences. Bi-LSTM captures context.

Libraries

SpaCy: pre-trained models. Stanford NER. Hugging Face: BERT-NER.

Key Takeaways

  1. NER labels sequence tokens
  2. CRF models sequential labels
  3. BERT-based models are current state-of-art

Advertisement

Advertisement

Need More Practice?

Get personalized data science help from ChatWhole's AI-powered platform.

Get Expert Help →