Skip to main content

SE::Seznam - Scraper for the Czech search engine seznam.cz

Seznam

Overview of the scraper

The Seznam search results scraper.. Thanks to the Seznam scraper, you can obtain large databases of links ready for further use. You can use queries in the same form as you enter them into the Dogpile search bar, including search operators (site, inurl etc.).

A-Parser functionality allows you to save Seznam scraper settings for future use (presets), ), set up a scraping schedule, and much more. You can use automatic query multiplication, substitution of subqueries from files, iteration of alphanumeric combinations and lists to get the maximum possible number of results.

The results can be saved in the form and structure you need, thanks to the built-in powerful templating engine Template Toolkit which allows applying additional logic to the results and outputting data in various formats, including JSON, SQL and CSV..

Collected data

  • Links, anchors, and snippets from search results
  • List of related keywords
Collected data

Capabilities

  • Scrapes the maximum number of results provided by Seznam - 50 pages with 20 items in the search results
  • Total maximum number of results per query - 1000

Use cases

  • Collecting link databases - for A-Poster, XRumer, AllSubmitter, etc.
  • Searching for backlinks (mentions) of sites
  • Searching for vulnerable websites
  • Any other uses involving scraping Seznam in one form or another

Queries

As queries, you should specify search phrases, just as if you were entering them directly into the Seznam search form, for example:

test query
okna Moskva
site:a-parser.com
inurl:auto

Query substitutions

You can use built-in macros to multiply queries, for example, if we want to get a very large database of forums, we specify a few main queries in different languages:

forum
forum
foro
论坛

In the queries format, we will specify iterating symbols from a to zzzz., This method allows you to maximally rotate the search results and get many new unique results:

$query {az:a:zzzz}

This macro will create 475254 additional queries for each initial search query, resulting in a total of 4 x 475254 = 1901016 search queries. The number is impressive, but this is not a problem at all for A-Parser. With a speed of 2000 queries per minute, such a task will be processed in just 16 hours.

Using operators

You can use search operators in the query format, so it will be automatically added to every query in your list:

site:$query

Output results examples

A-Parser supports flexible results formatting thanks to the built-in templating engine Template Toolkit, which allows it to output results in any desired form, as well as in a structured format, for example, CSV or JSON.

Exporting a list of links

Analogous to SE::Google.

Analogous to SE::Google.

Analogous to SE::Google.

Saving in SQL format

Analogous to SE::Google.

Dump results to JSON

Analogous to SE::Google.

Results processing

A-Parser allows processing results directly during scraping. In this section, we provide the most popular use cases for the Seznam scraper

Analogous to SE::Google.

Analogous to SE::Google.

Extracting domains

Analogous to SE::Google.

Removing tags from anchors and snippets

Analogous to SE::Google.

Analogous to SE::Google.

Possible settings

Parameter NameDefault ValueDescription
Pages count5Number of pages to scrape (from 1 to 50)
Links per page10Number of links per page (10 / 20)