flow

web scraping

with BeautifulSoup

text cleaning

with fuzzywuzzy

document embeddings

with doc2vec

machine segmentation

with topictiling

geospatial analysis

with carto-db