rvest
This page contains several scraping exercises inspired, that is, copied, from https://www.r-bloggers.com/web-scraping-exercises/ also available in https://www.r-exercises.com/start-here-to-learn-r/.
Most of the code, however, has been changed because, as time goes by, the web content is modified and old code does not work anymore.
Remember
Consider the url ‘https://statbel.fgov.be/en/themes/indicators/prices/service-price-indices#panel-11’
Extract all the information load on table.
Consider the url ‘http://www2.sas.com/proceedings/sugi30/toc.html’
Extract all the papers names, from 001-30 to 268-30
HINT: Use selectorgagdget to see that selector cite
is
asso ciated with the paper titles.
Consider the url ‘http://www.gibbon.se/Retailer/Map.aspx?SectionId=832’
Extract all the options (Countries) availables on select button.
Consider the url ‘http://r-exercises.com/start-here-to-learn-r/’
Extract all the topics available on the url.
Consider the url ‘http://www.immobiliare.it/Roma/agenzie_immobiliari_provincia-Roma.html’
Extract all inmobiliaries names published on first page.
Consider the url=‘http://www.dictionary.com/browse/’ and the words ‘handy’,‘whisper’,‘lovely’,‘scrape’.
Build a data frame, where the first variables is “Word” and the second variables is “definitions”. Scrape the definitions from the url.
Write a script to find out which actor appears in higher number of Star War movies.
Hint: The idea is similar to the previous exercise but with a litlle more work you can