에너지경제연구원 전자도서관

로그인

에너지경제연구원 전자도서관

자료검색

  1. 메인
  2. 자료검색
  3. 통합검색

통합검색

단행본Collecting more data from the modern web

Web Scraping with Python: Collecting More Data from the Modern Web

판사항
Second edition
발행사항
Sebastopol, CA : O'REILLY, 2018
형태사항
xv, 288 p. : illustrations ; 24cm
서지주기
Includes index
소장정보
위치등록번호청구기호 / 출력상태반납예정일
지금 이용 불가 (1)
자료실E207060대출중2025.07.07
지금 이용 불가 (1)
  • 등록번호
    E207060
    상태/반납예정일
    대출중
    2025.07.07
    위치/청구기호(출력)
    자료실
책 소개

If programming is magic then web scraping is surely a form of wizardry. By writing a simple automated program, you can query web servers, request data, and parse it to extract the information you need. The expanded edition of this practical book not only introduces you web scraping, but also serves as a comprehensive guide to scraping almost every type of data from the modern web.

Part I focuses on web scraping mechanics: using Python to request information from a web server, performing basic handling of the server? s response, and interacting with sites in an automated fashion. Part II explores a variety of more specific tools and applications to fit any web scraping scenario you? re likely to encounter.

  • Parse complicated HTML pages
  • Develop crawlers with the Scrapy framework
  • Learn methods to store data you scrape
  • Read and extract data from documents
  • Clean and normalize badly formatted data
  • Read and write natural languages
  • Crawl through forms and logins
  • Scrape JavaScript and crawl through APIs
  • Use and write image-to-text software
  • Avoid scraping traps and bot blockers
  • Use scrapers to test your website
목차
Preface Part I Building Scrapers 1. Your First Web Scraper 2. Advanced HTML Parsing 3. Writing Web Crawlers 4. Web Crawling Models 5. Scrapy 6. Storing Data Part II Advanced Scraping 7. Reading Documents 8. Cleaning Your Dirty Data 9. Reading and Writing Natural Languages 10. Crawling Through Forms and Logins 11. Scraping JavaScript 12. Crawling Through APIs 13. Image Processing and Text Recognition 14. Avoiding Scraping Traps 15. Testing Your Website with Scrapers 16. Web Crawling in Parallel 17. Scraping Remotely 18. The Legalities and Ethics of Web Scraping Index