Skip to main content

SE::Baidu - Baidu search engine results parser

img

Overview of the parser

Baidu search results parser. Thanks to the Baidu parser, you will be able to obtain huge databases of links ready for further use. You can use queries in the same form as you enter them in the Bing search bar, including search operators (filetype, site, intitle).

A-Parser functionality allows you to save Baidu parser settings for future use (presets), set a parsing schedule, and much more. You can use automatic query multiplication, substitution of subqueries from files, brute-forcing of alphanumeric combinations and lists to obtain the maximum possible number of results.

In the Baidu parser, saving results is possible in the form and structure you need, thanks to the built-in powerful Template Toolkit template engine, which allows you to apply additional logic to the results and output data in various formats, including JSON, SQL, and CSV.

Use cases for the parser

Collected data

  • Links
  • Snippets
  • Anchors
  • Total number of results
  • List of related words
  • Number of search result pages
Collected data

Capabilities

  • Parses up to 5000 results per query
  • Support for all Baidu search operators (filetype:, site:, intitle:).
  • Collects search results for a query and related keywords
  • Conversion of shortened links to full ones (Get full links option)

Use cases

  • Collecting link databases - for A-Poster, XRumer, AllSubmitter, etc.
  • Assessing competition for keywords
  • Checking website indexing
  • Collecting pages that contain specified keywords in the page title

Queries

You should specify search phrases as queries, for example:

test
site:www.baidu.com
百度产品大全
intitle:parser

Query substitutions

You can use built-in macros for query multiplication; for example, if we want to get a very large database of forums, we specify several main queries in different languages:

forum
forum
foro
论坛

In the query format, we will specify a character brute-force from a to zzzz; this method allows for maximum rotation of search results and obtaining many new unique results:

$query {az:a:zzzz}

This macro will create 475254 additional queries for each original search query, which in total will give 4 x 475254 = 1901016 search queries—an impressive figure, but not a problem for A-Parser. At a speed of 2000 queries per minute, such a task will be processed in just 16 hours.

Using operators

You can use search operators in the query format, so they will be automatically added to each query from your list:

site:$query

Output results examples

A-Parser supports flexible result formatting thanks to the built-in Template Toolkit template engine, which allows it to output results in arbitrary forms, as well as structured ones like CSV or JSON.

Exporting a list of links

Same as in SE::Google.

Same as in SE::Google.

Same as in SE::Google.

Same as in SE::Google.

Keyword competition

Same as in SE::Google.

Checking link indexing

Same as in SE::Google.

Saving in SQL format

Same as in SE::Google.

Dumping results to JSON

Same as in SE::Google.

Results processing

A-Parser allows you to process results directly during parsing; in this section, we have provided the most popular cases for the Baidu parser.

Same as in SE::Google.

Same as in SE::Google.

Extracting domains

Same as in SE::Google.

Removing tags from anchors and snippets

Same as in SE::Google.

Same as in SE::Google.

Possible settings

Parameter nameDefault valueDescription
Pages count5Number of pages to parse (from 1 to 100)
Links per page50Number of links in search results per page (10 / 20 / 50)
Get full linksConversion of shortened links to full ones (disabled by default)