TensorLearn
Back to Course
NLP Specialist: BERT & Beyond
Module 9 of 11

9. Topic Modeling (BERTopic)

1. Beyond LDA

Old School: Latent Dirichlet Allocation (LDA) uses bags of words. New School: BERTopic uses Embeddings.

2. The Pipeline

  1. Embed documents (SBERT).
  2. Reduce Dimensions (UMAP).
  3. Cluster (HDBSCAN).
  4. Extract Keywords (c-TF-IDF).

Mark as Completed

TensorLearn - AI Engineering for Professionals