SE::Bing::Images - Bing Search Engine Image Scraper
Scraper Overview
Bing search engine image scraper. Thanks to the SE::Bing::Images scraper, you can receive databases of image links or images ready for further use. You can use queries in the same way as you enter them in the Bing search bar.
The A-Parser functionality allows you to save the parsing settings of the Bing scraper for further use (presets), set up a parsing schedule, and much more. You can use automatic query replication, substitution of subqueries from files, enumeration of alphanumeric combinations and lists to obtain the maximum possible number of results.
Saving results is possible in the form and structure that you need, thanks to the built-in powerful templating engine Template Toolkit which allows you to apply additional logic to the results and output data in various formats, including JSON, SQL, and CSV.
Scraper Use Cases
Downloading Images by Link
A-Parser allows you to use a chain of tasks, upon completion of the first one, the second one will start execution, with the links from the first one as queries for the second task.
Download Example
How to import the example into A-Parser
eJyNVUtT2zAQ/iuMhkNoQ2IOvfjCBNpM6VBCIZxCOqPGa1cgS0aSA4zJf++ubPwI
JvQmrfa9334qmOP23l4asOAsCxcFy/yZhexEqGTvLOUJ7H3Vj0pqHoFhQ5ZxY8GQ
8oJdfwtD0gtDr2jxOYKY59Kx5XLI0BUe7VSblJPL/exoVPmvH6/5GuYaH2MhoRFP
8XbBUyCriDug11HsHQ0ORu6JPPAoEk5oxWUZgXJqot4o8ZCTvXUGU0R9vBoBdmp0
imIH3gkJn18zXLB9f2foJvf2v0obFsZcWhgyi+lOOSYTbb8IB4Y7bWYZ5YTygmk1
kfIc1iAbNe//JBcSu2knMRqdVYb9KrM3PjZ1ie1QazCPBnOovfjbyexnYxXpc51g
5dEfrFuKVDi821OdKxpOgMJ7gKzu24VGSaoN1GGcyaEOjqDJQEWo2ExtkjWiThWd
yXSFK61ikcwwfyMieNXM1RyROVOnOs0kUFmshFgXjLmFqwYwE1sNhS51utuuTn1A
6kMF1SFzWkv747pMPDMC8fiF0k2xre0cqtauuJQ3V+ed7Bp8kWedwEoLxUjXQaIR
UljXZlh01ucCXBh+n88vW3uDKgYSeEIv2BkHyh265wzC0adjQQ0YDzKVvNxlcJy8
JCI+8M1H/TkqoY2gvTCGP1fLQ8WVL6vcOp2W3aonhvK/wP0wXkWVo1jy1pbSatC+
+BLWXHqAKK2g8Y8b6vxMtIcLLR4o245ULlypV0VR2sFDziXbtPmi2X1fsh2XWzlS
eXq4X1RnVNiMiFFKtz2IUbmU7wC9D8O99PA/6NkJwqDZO6U/JK1tNnwHaW1Q9dMs
cibvWcKCWZ2bFbkpiY6wT8OldrLlsMbeeLD4PV5+Pri9HQ2Owy7k9nswV4GgNN8s
h80/0re2PXSxRbdBHxnVm9nDwkF7/7ZXusNiwTtEtP1v+HF9xOrBbkbffu6webDp
45hgx5/Uz5W7PoOg/RFQQD8D7PmRH1JJRvX/X7z90cMCjRqiwive7+xlaUkYr3Qw
pvV7f7T5B3ZQ48A=
Collected Data
- Image links
- Image width and height
- Image formats
- Link to the page with the image
- Anchors
- Image preview
Capabilities
- Specifying the number of pages
- Choosing the language and country of delivery
- Choosing safe search options
Use Cases
- Collecting images to fill your blogs and websites
- Collecting avatar databases
Queries
Queries should be specified as search phrases, for example:
dogecoin
Football
Waterfall
Speak in english
Cats and dogs
cars
Query Substitutions
You can use built-in macros to expand queries, for example, if we want to get a very large database of forums, we will specify several main queries in different languages:
forum
форум
foro
论坛
In the query format, we will specify the enumeration of characters from a to zzzz, this method allows to rotate the search output to the maximum and obtain a multitude of new unique results:
$query {az:a:zzzz}
This macro will create 475254
additional queries for each original search query, which in total will give 4 x 475254 = 1901016
search queries, an impressive number, but it's not a problem for A-Parser at all. At a speed of 2000
requests per minute, this task will be processed in just 16
hours.
Output Results Examples
A-Parser supports flexible formatting of results thanks to the built-in templating engine Template Toolkit, which allows it to output results in any form, as well as in a structured form, for example CSV or JSON.
Default Output
Result format:
$serp.format('$link\n')
Example result:
https://image.cnbcfm.com/api/v1/image/104930399-6ED2-REQ-010517-Dogecoin.jpg?v=1529477162
https://i0.wp.com/promoterdigital.com/wp-content/uploads/2017/12/Como-Comprar-Dogecoin.png?ssl=1
https://www.shdon.com/blog/images/2014/03/dogecoin-meme.png
https://www.thecoinrepublic.com/wp-content/uploads/2019/05/dogecoin.jpg
https://static.india.com/wp-content/uploads/2021/01/dogecoin.jpg
https://i0.wp.com/criptotendencia.com/wp-content/uploads/2020/07/Dogecoin-Vuelve-el-efecto-TikTok.jpg?fit=1200%2C800&ssl=1
https://www.cryptonewsz.com/wp-content/uploads/2019/01/Dogecoin-DOGE-Price-Analysis-Jan.11.jpg
https://www.mebere.com/sites/default/files/inline-images/Invest in Dogecoin.jpg
https://www.thecoinrepublic.com/wp-content/uploads/2020/04/Dogecoin-Doge-logo2.png
https://assets.coingecko.com/coins/images/7670/large/DogeCoin-Private-logo.png?1549240189
https://thelibertarianrepublic.com/wp-content/uploads/2019/05/Screen-Shot-2019-05-17-at-2.01.29-PM.jpg
https://i.teknolojioku.com/storage/files/images/2021/01/29/dogecoin-1-b1lB_cover.jpg
Output in CSV Table
Result format:
[% FOREACH item IN serp;
tools.CSVline(query, item.link, item.width, item.height, item.type, item.pagelink, item.thumb, item.anchor);
END %]
Example result:
dogecoin,https://interanalyst.us/wp-content/uploads/2018/01/dogecoin-logo.png,900,900,png,https://interanalyst.us/markets/cryptocurrency-trading-signals/dogecoin-doge/,https://tse1.mm.bing.net/th?id=OIP.MG983TODjELbLg7pfna_8AHaHa&pid=15.1,"Dogecoin | DOGE | Investment Charts with Algorithmic Trading Signals ..."
dogecoin,http://vkool.com/wp-content/uploads/2018/03/Dogecoin-1.jpg,1200,800,jpeg,https://vkool.com/dogecoin-doge-token-review/,https://tse1.mm.bing.net/th?id=OIP.Az4VhLecunYQo3ztj_waTgHaE8&pid=15.1,"Dogecoin (DOGE) Token Review (UPDATED 2018): What is It? | VKOOL"
dogecoin,https://www.cryptonewsz.com/wp-content/uploads/2019/02/Dogecoin.jpg,2048,1304,jpeg,https://www.cryptonewsz.com/price-analysis-of-dogecoin-doge/9113/,https://tse4.mm.bing.net/th?id=OIP.5B3vzuHUWNMGotGqJGWRCQHaEt&pid=15.1,"Price Analysis of Dogecoin (DOGE) - CryptoNewsZ"
dogecoin,https://digitalmoneytimes.com/wp-content/uploads/2018/09/dogecoin-petition.jpg,1000,750,jpeg,https://digitalmoneytimes.com/dogecoin-price-1-million-doge-nearly-equals-the-value-of-1-bitcoin/,https://tse3.mm.bing.net/th?id=OIP.-1QNz1XY_Ox2_MHmb2XqJAHaFj&pid=15.1,"Dogecoin Price: 1 Million Doge Nearly Equals the Value of 1 Bitcoin ..."
dogecoin,https://www.cryptonewsz.com/wp-content/uploads/2019/01/Dogecoin-DOGE-Price-Analysis-Jan.-22.jpg,1920,1200,jpeg,https://www.cryptonewsz.com/dogecoin-doge-price-analysis-jan-22/6798/,https://tse1.mm.bing.net/th?id=OIP.-CoV662aYK1YAz8SEo3KDwHaEo&pid=15.1,"Dogecoin (DOGE): Price Analysis, Jan. 22 - CryptoNewsZ"
Saving in SQL format
Result format:
[% FOREACH serp;
"INSERT INTO serp VALUES('" _ query _ "', '"; link _ "', '"; anchor _ "', '"; pagelink _ "', '"; thumb _ "')\n";
END %]
Example of result:
INSERT INTO serp VALUES('dogecoin', 'https: //www.cryptonewsz.com/wp-content/uploads/2019/02/Dogecoin.jpg', 'Price Analysis of Dogecoin (DOGE) - CryptoNewsZ', 'https://www.cryptonewsz.com/price-analysis-of-dogecoin-doge/9113/', 'https://tse4.mm.bing.net/th?id=OIP.5B3vzuHUWNMGotGqJGWRCQHaEt&pid=15.1')
INSERT INTO serp VALUES('dogecoin', 'https: //interanalyst.us/wp-content/uploads/2018/01/dogecoin-logo.png', 'Dogecoin | DOGE | Investment Charts with Algorithmic ...', 'https://interanalyst.us/markets/cryptocurrency-trading-signals/dogecoin-doge/', 'https://tse1.mm.bing.net/th?id=OIP.MG983TODjELbLg7pfna_8AHaHa&pid=15.1')
INSERT INTO serp VALUES('dogecoin', 'https: //www.cryptonewsz.com/wp-content/uploads/2019/03/stack-of-dogecoins.jpg', 'Dogecoin (DOGE) Price Analysis : The Future of Dogecoin ...', 'https://www.cryptonewsz.com/dogecoin-doge-price-analysis-the-future-of-dogecoin-looks-very-bright/11091/', 'https://tse4.mm.bing.net/th?id=OIP.GLgXn-8nxIgTZNkUHpiQ3AHaGR&pid=15.1')
INSERT INTO serp VALUES('dogecoin', 'https: //cdn.vox-cdn.com/thumbor/G_w4Nyo9IJx5q5xa5E92vJCVyUQ=/21x0:539x345/1200x800/filters:focal(21x0:539x345)/cdn.vox-cdn.com/assets/3727699/Dogecoin_logo.png', 'Bitcoin is so 2013: Dogecoin is the new cryptocurrency on ...', 'http://www.theverge.com/2013/12/16/5216862/bitcoin-is-so-2013-dogecoin-is-new-crypto-currency-on-the-block', 'https://tse1.mm.bing.net/th?id=OIP.ILO1wN5PD5jq3AOdG13itQHaE8&pid=15.1')
INSERT INTO serp VALUES('dogecoin', 'https: //www.fxwrite.com/wp-content/uploads/2019/07/dogecoin.jpg', 'Dogecoin Price Showing Record 10.79% Uptrend Momentum', 'https://www.fxwrite.com/dogecoin-price-analysis-dogecoin-doge-is-showing-an-uptrend-momentum/', 'https://tse1.mm.bing.net/th?id=OIP.7ipqVaPp3hBRzmVPMmXYEQHaEK&pid=15.1')
INSERT INTO serp VALUES('dogecoin', 'https: //www.cryptonewsz.com/wp-content/uploads/2019/01/Dogecoin-DOGE-Price-Analysis-Jan.-22.jpg', 'Dogecoin (DOGE): Price Analysis, Jan. 22 - CryptoNewsZ', 'https://www.cryptonewsz.com/dogecoin-doge-price-analysis-jan-22/6798/', 'https://tse1.mm.bing.net/th?id=OIP.-CoV662aYK1YAz8SEo3KDwHaEo&pid=15.1')
Dump results to JSON
Общий формат результата:
[% IF notFirst;
",\n";
ELSE;
notFirst = 1;
END;
obj = {};
obj.query = query;
obj.images = [];
FOREACH item IN p1.serp;
obj.images.push({
link = item.link
width = item.width
height = item.height
type = item.type
pagelink = item.pagelink
thumb = item.thumb
anchor = item.anchor
});
END;
obj.json %]
Начальный текст:
[
Конечный текст:
]
Example of result:
[{
"images": [
{
"link": "http://mymodernmet.com/wp/wp-content/uploads/2017/03/gabrielius-khiterer-stray-cats-8.jpg",
"width": "750",
"anchor": "Dignified Stray Cat Photos Celebrate Their Unique Beauty",
"pagelink": "https://mymodernmet.com/gabrielius-khiterer-stray-cat-photos/",
"type": "jpeg",
"thumb": "https://tse3.mm.bing.net/th?id=OIP.ZjfS8JQc9sahsK0-w8dRFAHaKJ&pid=15.1",
"height": "1028"
},
{
"link": "https://cms.qz.com/wp-content/uploads/2016/09/catscolonizedbyboat.jpg?quality=75&strip=all&w=1400",
"width": "1400",
"anchor": "DNA shows that cats colonized most of the planet by boat ...",
"pagelink": "https://qz.com/793979/dna-evidence-shows-cats-colonized-earth-by-boat/",
"type": "jpeg",
"thumb": "https://tse1.mm.bing.net/th?id=OIP.V6n6qGkcNx4JquiU6AvbBAHaEK&pid=15.1",
"height": "788"
},
{
"link": "https://hddesktopwallpapers.in/wp-content/uploads/2015/09/cute-baby-cats-wallpaper.jpg",
"width": "2560",
"anchor": "cute baby cats wallpaper - HD Desktop Wallpapers | 4k HD",
"pagelink": "https://hddesktopwallpapers.in/cute-baby-cats-wallpaper/",
"type": "jpeg",
"thumb": "https://tse4.mm.bing.net/th?id=OIP.QdEkrZjd1c_VN_aUtleoFgHaEo&pid=15.1",
"height": "1600"
},
],
"query": "cats"
}]
To make the "Initial text" and "Final text" options available in the Task Editor, you need to activate "More options".
Possible settings
Parameter | Default value | Description |
---|---|---|
Pages count | 10 | Number of pages to scrape |
Safe search | Moderate | Safe search |
Region | Based on IP | Region |
Interface language | Any | Interface language |