Skip to main content

SE::Rambler - Rambler search results scraper

img

Overview of the scraper

This Rambler search results scraper. allows you to obtain large databases of links ready for further use. You can use queries in the same format as you enter them into the Rambler search bar, including search operators (site, ip etc.).

A-Parser's functionality allows you to save Rambler scraper parsing settings for future use (presets), ), set up a parsing schedule, and much more. You can use automatic query multiplication, substitution of sub-queries from files, iteration over alphanumeric combinations and lists to get the maximum possible number of results.

Results can be saved in the format and structure you need, thanks to the built-in powerful templating engine Template Toolkit which allows applying additional logic to the results and outputting data in various formats, including JSON, SQL and CSV.

Collected Data

  • Number of results in the search output
  • Links, anchors, and snippets from the output
  • List of related keywords (hints)

what data the SE::Rambler scraper collects

Capabilities

  • Supports Rambler search operators (url:, site:, inurl:, host:, rhost:, domain:.)
  • Scrapes up to 25 pages, with 10 to 50 results per page
  • Scrapes related keywords ($hints)
  • Option to use captcha-solving services
  • Select the result device: regular desktop, mobile Android, or mobile iOS

Use Cases

  • Collecting link databases
  • Assessing competition for keywords
  • Finding backlinks (mentions) for websites
  • All cases where you need to scrape Rambler search results

Queries

Enter queries as you would in Rambler search. For example, if you need only links from one site. Enter the following in the query field:

"buy doors" site:http://kp.ru

Query substitutions

You can use built-in macros to multiply queries. For example, if we want to get a very large database of forums, we can specify several main queries in different languages:

forum
forum
foro
论坛

In the query format, we specify iterating through characters from a to zzzz, . This method maximizes search result rotation and yields many new unique results:

$query {az:a:zzzz}

This macro will create 475254 additional queries for each initial search query, which totals 4 x 475254 = 1901016 search queries. This impressive number is no problem for A-Parser. At a rate of 2000 queries per minute, this task will be processed in just 16 hours.

Using operators

You can use search operators in the query format, so it will be automatically added to every query in your list:

site:$query

Output Results Examples

A-Parser supports flexible result formatting thanks to the built-in templating engine Template Toolkit, which allows it to output results in any desired form, as well as in structured formats like CSV or JSON

Exporting link list

Similar to SE::Google.

Similar to SE::Google.

Similar to SE::Google.

Result format:

$hints.format('$hint\n')

Result example:

habrahabr
habr
habrahabr ru
xabra
livebusiness
eureka
electronic accountant
elba island
elba electronic accountant
habrohabr
...

Saving in SQL format

Similar to SE::Google.

Dump results to JSON

Similar to SE::Google.

Results Processing

A-Parser allows results to be processed directly during scraping. In this section, we have listed the most popular use cases for the Rambler scraper

Similar to SE::Google.

Similar to SE::Google.

Extracting domains

Similar to SE::Google.

Removing tags from anchors and snippets

Similar to SE::Google.

Similar to SE::Google.

Available Settings

Parameter NameDefault ValueDescription
DeviceDesktopSelect the result device: regular desktop, mobile Android, or mobile iOS
Pages count5Number of pages to scrape (from 1 to 25)
Links per page10Number of results per page (10/15/30/50)
Rambler region IDOption to specify region. Requires region ID. How to find the required ID is described here
SortSites by relevanceSelect the result sorting option
Results filteringModerateSelect the result filtering option
Results languageAny languageSelect the search language for results
Serp timeAnytimeSelect the result period
Results typeAny formatSelect the result type (mime type)
Exact matchStrict query matching
Disable autocorrectDisables autocorrection, allowing scraping the output exactly for the specified query
Use sessionsSaves good sessions, allowing faster scraping with fewer errors
AntiGate presetdefaultDetermines whether to use Util::AntiGateUtil::AntiGate to bypass captchas