004 - HTML Scraping with Beautiful Soup
Manage episode 194214591 series 1506484
Contenuto fornito da eddyizm and octon. Tutti i contenuti dei podcast, inclusi episodi, grafica e descrizioni dei podcast, vengono caricati e forniti direttamente da eddyizm and octon o dal partner della piattaforma podcast. Se ritieni che qualcuno stia utilizzando la tua opera protetta da copyright senza la tua autorizzazione, puoi seguire la procedura descritta qui https://it.player.fm/legal.
Stream Our Mistakes EP 004In this episode, Matt walks us through html/web scraping using the popular python library, Beautiful Soup.
Here's the code snippet from the session and links:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 | # Created for Stream Our Mistakes # https://streamourmistakes.blogspot.com/ # Reference: # https://docs.python.org/3/library/urllib.request.html # https://www.crummy.com/software/BeautifulSoup/bs4/doc/ from bs4 import BeautifulSoup import urllib.request ''' # local html to play with from documentation Uncomment to enable html_doc = """ |
links:
https://docs.python.org/3/library/urllib.request.html
https://www.crummy.com/software/BeautifulSoup/bs4/doc/
Subscribe to the podcast on apple podcasts, google play, stitcher
site: http://octon.io/
github: https://github.com/mmdempsey
eddyizm
site: http://eddyizm.com
twitter: http://twitter.com/eddyizm
github: https://github.com/eddyizm
perry
github: https://github.com/apk29
---
**youtube live broadcast:**
https://youtube.com/user/eddyizm/live
Subscribe to our channel and follow my twitter feed to be notified of our next live broadcast and feel free to leave us comments and suggestions on what you want to see.
5 episodi