Reddit::PostInfo - Scraper for Reddit Post Information

SE::Quora

Overview of the Reddit::PostInfo scraper

Reddit::PostInfo - scraper for post information on Reddit.

Collects information about the post, including comments.

You can use automatic query replication, substitution of subqueries from files, iteration through alphanumeric combinations and lists to obtain the maximum possible number of results.

A-Parser's functionality allows you to save the scraping settings for the Reddit::PostInfo scraper for future use (presets), set a scraping schedule, and much more.

Results can be saved in the format and structure you need, thanks to the built-in powerful templating engine Template Toolkit which allows you to apply additional logic to the results and output data in various formats, including JSON, SQL and CSV.

Go to DEMO Buy A-Parser Pro ($299)

Data collected

Link to the post
Title and flair
Score, number of comments, and number of awards
Creation date
Community where the post was published
Author and their flair
Post content: text in markdown, link to media content, and link to an external resource
Whether the post is sponsored

Array of comments:

ID
Parent ID
Link
Author
Text (cleaned of tags)
Text (with tags)

Capabilities

Ability to limit the number of comments to scrape

Queries

One query type is supported:

Links to posts

Example:

https://www.reddit.com/r/Audi/comments/151atr5/audi_r8_high_speed_crash_294_km/
https://www.reddit.com/r/Lexus/comments/1dc7r2m/anyone_come_from_audi_to_lexus/

By default, the result will output information about the post without comments

Output options

A-Parser supports flexible result formatting thanks to the built-in templating engine Template Toolkit, which allows it to output results in an arbitrary form, as well as in a structured format, such as CSV or JSON.

Available settings

note

General settings for all scrapers

Parameter	Default value	Description
Max comments count	50	Number of comments to scrape

Overview of the Reddit::PostInfo scraper​

Data collected​

Capabilities​

Queries​

Links to posts​

Output options​

Available settings​