Seminar Series "Big Data and Natural Language Processing in the Social Sciences and Humanities" #4
Date: March 2, 2017
Venue: UTokyo, Komaba Campus, Building 2, Room 303
Nestor Alvaro (Data Scientist at BEEVA, Spain)
The DOM tree: How to navigate the contents from HTML webpages and RSS feeds
HTML is the type of document used to build webpages and is a widely extended format although not many people is aware of its internal structure. This workshop focuses on that internal structure, and also on the potential of the Document Object Model (DOM) for navigating documents.
The DOM tree allows us to explore webpages and other XML documents using their internal structure. This workshop will have a practical part where we will present a number of examples on how to use the DOM tree.
Nestor Alvaro is a Data Scientist at BEEVA (https://www.beeva.com/) researching on machine learning (ML) and natural language processing (NLP).
BEEVA is a technology consultancy firm created by the second largest bank in Spain: BBVA (https://en.wikipedia.org/wiki/Banco_Bilbao_Vizcaya_Argentaria).
Research > Seminars > Seminar Series "Big Data and Natural Language Processing in the Social Sciences & Humanities" >