finscraper

Spiders Documentation finscraper cover

The library provides an easy-to-use API for fetching data from various Finnish websites:

Website

Type

Spider API class

Ilta-Sanomat

News article

ISArticle

Iltalehti

News article

ILArticle

YLE Uutiset

News article

YLEArticle

Suomi24

Discussion thread

Suomi24Page

Muusikoiden.net

Discussion thread

MNetPage

Vauva

Discussion thread

VauvaPage

Oikotie Asunnot

Apartment ad

OikotieApartment

Tori

Item deal

ToriDeal

Documentation is available at https://finscraper.readthedocs.io and simple online demo here.

Installation

pip install finscraper

Quickstart

Fetch 10 news articles as a pandas DataFrame from Ilta-Sanomat:

from finscraper.spiders import ISArticle

spider = ISArticle().scrape(10)

articles = spider.get()

The API is similar for all the spiders:

Finscraper in action

Contributing

Please see CONTRIBUTING.md for more information.


Jesse Myrberg (jesse.myrberg@gmail.com)