-
Join our Telegram chat: https://t.me/a_parser_enDismiss Notice
HTML::EmailExtractor - parsing of e-mail addresses from site pages
-
Collected data(top)
- Collect email addresses from specified pages
Opportunity(top)
- Going to the internal pages of the site to the specified depth (option Parse to level) - allows you to walk through all pages of the site, collecting internal and external links
- Determining the site encoding
- Definition of links for links
- Ability to specify subdomains as internal pages of a site
Use options(top)
- Used to collect e-mail addresses that are placed on site pages
- Ability to output the number of e-mail addresses hosted on the site through the variable $ mailcount
Requests(top)
In the request it is necessary to specify the url from which you want to scrape the e-mail address(es)
Option Parse to level(top)
Specifies to a parser to pass according to adjacent pages of the site into depth to the specified level, for example:
- If the 1st level that is specified the parser will follow all links specified on the initial page
- If the 2nd level that is specified the parser will follow all links specified on the initial page according to all links collected from pages at the first level
- etc.
Results(top)
- The result is the e-mail address (es) of the site
Possible settings(top)
Global settings for all parsers
Misery713 likes this.