TensorLearn
Back to Course
NLP Specialist: BERT & Beyond
Module 11 of 11

11. NLP Cheatsheet

Tokenizer

python
tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased") tokens = tokenizer("Hello world", return_tensors="pt")

Regex

python
import re # Find emails re.findall(r'[\w\.-]+@[\w\.-]+', text)

Embeddings

python
model = SentenceTransformer('all-MiniLM-L6-v2') emb = model.encode("Hello world")

Mark as Completed

TensorLearn - AI Engineering for Professionals