Web Scraping 101: Julia, HTTP, Gumbo & Cascadia | Tutorial 11 of 13 | Julia Web Tools for Beginners

In this coding tutorial, you’ll learn the basics of web scraping using Julia and VS Code.You’ll learn how to combine the HTTP, Gumbo and Cascadia packages to create a flexible tool that will allow you to scrape both tabular and non-tabular data from the internet.Once you’ve scraped the data, you’ll take it one step further […]

In this coding tutorial, you’ll learn the basics of web scraping using Julia and VS Code.
You’ll learn how to combine the HTTP, Gumbo and Cascadia packages to create a flexible tool that will allow you to scrape both tabular and non-tabular data from the internet.
Once you’ve scraped the data, you’ll take it one step further by saving that data as a CSV file by using Julia’s DelimitedFiles library.
This tutorial is intended for amateurs and hobbyists.
This tutorial is episode 11 of a 13-part series introducing Julia Web Tools for Beginners.

Schedule: New tutorials are posted on Sundays / Mondays.
Prerequisite: Julia, HTML & CSS.

00:00 Intro
01:03 Web Scraping
05:19 Programming Environment
06:39 Example 1: Scrape a simple HTML table and convert it into a CSV file
15:06 Example 2: Scrape a larger HTML table and convert it into a CSV file
20:23 Example 3: Scrape non-tabular data using Gumbo and convert it into a CSV file
25:29 Example 4: Scrape non-tabular data using Gumbo+Cascadia and convert it into a CSV file
28:32 Example 5: Scrape a Wikipedia table and convert it into a CSV file
39:52 Recap
41:04 Outro

Link to The Julia Programming Language YouTube Channel (JuliaCon 2021):
https://www.youtube.com/user/JuliaLanguage

Link to the GitHub repository for this tutorial:
https://github.com/julia4ta/tutorials/tree/master/Series%2003/Tutorial%2003×11

Links to websites used in the examples.
Example 1:
https://dabblingdoggo.github.io/mysite3/doggo/data.html
Example 2:
https://dabblingdoggo.github.io/mysite11/
Example 3:
https://dabblingdoggo.github.io/mysite3/doggo/about.html
Example 4:
https://dabblingdoggo.github.io/mysite3/doggo/about.html
Example 5:
https://en.wikipedia.org/wiki/ISO_3166-1

Links for packages used in this tutorial.
Cascadia:
https://github.com/Algocircle/Cascadia.jl
Gumbo:
https://github.com/JuliaWeb/Gumbo.jl
HTTP:
https://github.com/JuliaWeb/HTTP.jl

Link to the Julia Programming Language website:
https://julialang.org/
Link to the VS Code website:
https://code.visualstudio.com/

Links to My Videos and Playlists:
Julia Web Tools for Beginners Playlist:
https://www.youtube.com/watch?v=O15o1kdXAFo&list=PLhQ2JMBcfAsh5QGVR-DE6qJCGx7CC1OSh />Julia Analysis for Beginners Playlist:
https://www.youtube.com/watch?v=7M8e2Q5BirA&list=PLhQ2JMBcfAsjZTA8_jGhz3BVqYgOeyyeu />Julia for Beginners Playlist:
https://www.youtube.com/watch?v=0oChN11wf_4&list=PLhQ2JMBcfAsi_3g2AFJ6B84d8c5jw5kXp />
Notice of Non-Affiliation and Disclaimer:
I am not affiliated, associated, authorized, endorsed by, or in any way officially connected with The Julia Programming Language, Julia Academy, Julia Computing, Microsoft, or any of their subsidiaries or their affiliates.
Nor am I affiliated, associated, authorized, endorsed by, or in any way officially connected with any packages or libraries used in this video.
All marks, emblems and images are registered trademarks of their respective owners. Use of them does not imply any affiliation with or endorsement by them.

Web Scraping 101: Julia, HTTP, Gumbo & Cascadia | Tutorial 11 of 13 | Julia Web Tools for Beginners
30-Day Money-Back Guarantee

Includes

Full lifetime access
Access on mobile and TV

Categories

Archive

Working hours

Monday 9:30 am - 6.00 pm
Tuesday 9:30 am - 6.00 pm
Wednesday 9:30 am - 6.00 pm
Thursday 9:30 am - 6.00 pm
Friday 9:30 am - 5.00 pm
Saturday Closed
Sunday Closed
Web Scraping 101: Julia, HTTP, Gumbo & Cascadia | Tutorial 11 of 13 | Julia Web Tools for Beginners
Price:
Free
This website uses cookies and asks your personal data to enhance your browsing experience.