SE::Baidu - Baidu search engine results parser

Overview of the parser
Baidu search results parser. Thanks to the Baidu parser, you will be able to obtain huge databases of links ready for further use. You can use queries in the same form as you enter them in the Bing search bar, including search operators (filetype, site, intitle).
A-Parser functionality allows you to save Baidu parser settings for future use (presets), set a parsing schedule, and much more. You can use automatic query multiplication, substitution of subqueries from files, brute-forcing of alphanumeric combinations and lists to obtain the maximum possible number of results.
In the Baidu parser, saving results is possible in the form and structure you need, thanks to the built-in powerful Template Toolkit template engine, which allows you to apply additional logic to the results and output data in various formats, including JSON, SQL, and CSV.
Use cases for the parser
🔗 Parsing full Baidu links
This resource shows how to parse full links
🔗 Baidu Suggestions
Multi-level parsing of Baidu suggestions
🔗 JS parser JS::SE::Baidu::Suggest
Creating JS parsers. Getting Baidu suggestions
Collected data
- Links
- Snippets
- Anchors
- Total number of results
- List of related words
- Number of search result pages

Capabilities
- Parses up to 5000 results per query
- Support for all Baidu search operators (filetype:, site:, intitle:).
- Collects search results for a query and related keywords
- Conversion of shortened links to full ones (Get full links option)
Use cases
- Collecting link databases - for A-Poster, XRumer, AllSubmitter, etc.
- Assessing competition for keywords
- Checking website indexing
- Collecting pages that contain specified keywords in the page title
Queries
You should specify search phrases as queries, for example:
test
site:www.baidu.com
百度产品大全
intitle:parser
Query substitutions
You can use built-in macros for query multiplication; for example, if we want to get a very large database of forums, we specify several main queries in different languages:
forum
forum
foro
论坛
In the query format, we will specify a character brute-force from a to zzzz; this method allows for maximum rotation of search results and obtaining many new unique results:
$query {az:a:zzzz}
This macro will create 475254 additional queries for each original search query, which in total will give 4 x 475254 = 1901016 search queries—an impressive figure, but not a problem for A-Parser. At a speed of 2000 queries per minute, such a task will be processed in just 16 hours.
Using operators
You can use search operators in the query format, so they will be automatically added to each query from your list:
site:$query
Output results examples
A-Parser supports flexible result formatting thanks to the built-in Template Toolkit template engine, which allows it to output results in arbitrary forms, as well as structured ones like CSV or JSON.
Exporting a list of links
Links + anchors + snippets with position output
Outputting links, anchors, and snippets to a CSV table
Saving related keywords
Keyword competition
Checking link indexing
Saving in SQL format
Dumping results to JSON
Results processing
A-Parser allows you to process results directly during parsing; in this section, we have provided the most popular cases for the Baidu parser.
Link deduplication
Link deduplication by domain
Extracting domains
Removing tags from anchors and snippets
Filtering links by inclusion
Possible settings
| Parameter name | Default value | Description |
|---|---|---|
| Pages count | 5 | Number of pages to parse (from 1 to 100) |
| Links per page | 50 | Number of links in search results per page (10 / 20 / 50) |
| Get full links | ☐ | Conversion of shortened links to full ones (disabled by default) |