Skip to main content

SE::Google::TrustCheck - Website Trust Check

Scraper Overview

This scraper allows checking a site's Trust Rank in Google. All features of the SE::GoogleSE::Google. #CODEKEEP_L_20#

Overview: speed of operation

#CODEKEEP_L_25#

Results can be saved in the form and structure you need, thanks to the powerful built-in templating engine Template Toolkit which allows applying additional logic to the results and outputting data in various formats, including JSON, SQL and CSV.

Collected data

  • Check for Google's Trust (trustworthiness) in a site
  • Possible results - 0, 1 and 2:
    • 0 - no additional link blocks
    • 1 or 2 means that Google trusts the site, as it displays additional link blocks. 1 - means that the site has a horizontal link block, and 2 - means it has a large vertical link block

what data the SE::Google::TrustCheck scraper collects what data the SE::Google::TrustCheck scraper collects

Capabilities

  • Collecting a database of trusted sites
  • Supports selection of search country, domain, results language, and other settings

Queries

As queries, you must specify the URL of the site being searched, for example:

http://uraldekor.ru/  
http://a-parser.com/
http://www.yandex.ru/
http://google.com/
http://vk.com/
http://facebook.com/
http://youtube.com/

Query substitutions

You can use built-in macros to automatically substitute subqueries from files. For example, if we want to check sites/a site against a database of keys, we specify a few main queries:

ria.ru
lenta.ru
rbc.ru
yandex.ru

In the query format, we specify a macro for substituting additional words from the file Keywords.txt, . This method allows checking a site database against a keyword database and obtaining positions in the result:

$query {subs:Keywords}

This macro will create as many additional queries as there are in the file for each source search query, totaling [number of source queries (domains)] x [number of queries in the Keywords file] = [total number of queries] as a result of the macro execution.

You can also specify the protocol in the query format, so that only domains can be used as queries:

http://$query 

This format will prepend http:// to each query.

Examples of result output

A-Parser supports flexible result formatting thanks to the built-in templating engine Template Toolkit, which allows it to output results in any form, as well as in structured formats such as CSV or JSON

Exporting the trust check list

Result format:

$query: $trustrank\n

The result displays a list of links and their Trust check.

Example result:

http://www.yandex.ru/: 2
http://a-parser.com/: 1
http://vk.com/: 2
http://uraldekor.ru/: 0
http://google.com/: 2
...

Same as in SE::Google.

Same as in SE::Google.

Same as in SE::Google.

Keyword competition

Same as in SE::Google.

Checking link indexation

Same as in SE::Google.

Saving in SQL format

Same as in SE::Google.

Dump results to JSON

Same as in SE::Google.

Results processing

A-Parser allows results to be processed directly during parsing. In this section, we have provided the most popular use cases for the SE::Google::TrustCheck scraper

Saving domains with trust value "1"

Add a filter and select the trust variable $trustrank - Trust rank from the dropdown list. Select the type: String equals. Next, in String, enter the trust value we need, for example 1. With this filter, you can remove all results with unnecessary trust.

Example of filtering
Download example

How to import the example into A-Parser

eJx1VEtz2jAQ/iuMJod2hjhw6MU3woROOzSkCTkBB4HXRLWsNXrwGA//vSvZ2CYt
J3tf37f7raSSWW4y86LBgDUsXpSsCP8sZm9PcfwdcSshjufaGTv+gE3Wu+8FQ3OV
9VIhLWjWZwXXBrQHWNyoo6QEUu6kZf2S2VMBRIF70FokQEGRkJ2izrmlBkIa23Pp
fNrdzoE+xb07e2FeLhU738YpNB5PGqwWYDo4w8Fg0C1r2q8ZY9YQkLOKzqtU2Dku
u1D0j4UVqMgwoAw7r1YXIDMJc/jOi2FUC9oE3/ge5ljRQ+uekPXM8zBvwi34aFQJ
8uVrZI8egSeJ8JxcVgxe8Jb1XYldaE4h5XrNaPyJxtwPBgEgCHnpbsE+rC3ih4dK
X0ZILkD8rkpZnHJpoM8MdTzh1E/yOSJIIW5Rz4IU5C8ZqpGUU9iDbNMC/qMTMqFD
Mkqp6Edd+P+U2T8Y52bKLhWt/aCphwYlWI+zX21VglPckgDJmsaXIheWbDNGp/x+
BuTMAIpGumcvXY4aGpoauWanm1KA8mes3dyoaF1XY1xt59q5QZWK7aw+t5dMp+Z0
HWdqjHkhwc+lnJS0FgOv7SkZmXoN3mgb/Fw8DhR+9Mu9YxZRmp9vVauFFnQKv/kG
c1Kyy1pDbriU76/TboS1J4sMpznNlKGOtFsqfl89A9EG86U6HA7RiasEjiG4DS9C
Fdpn1TflG1gj1tYJnXXrkMI8t4Ut0qkkZc6r5oVpXqry1jsTl2da+x/zUhV4jXw6
+UhsEy7s8PwXfU7A2Q==
tip

See also: Results Filters

Same as in SE::Google.

Same as in SE::Google.

Extracting domains

Same as in SE::Google.

Removing tags from anchors and snippets

Same as in SE::Google.

Same as in SE::Google.

Possible settings

Supports all settings of the SE::GoogleSE::Google, scraper, plus additional settings:

Name of parameterDefault valueDescription
Pages count1Number of parsing pages in the search results (from 1 to 10)