Goop | Google Search Scraper | Perform Google Searches Without Being Blocked Or Hitting Limits

image

Google is number one search engine, almost people on internet use google as their search engine, and google is can use to help pentester get some dummy informations. Because, you know Google Dork is one tricks to get informations with deep search, with dork you can open deepest side of a websites.

Introduction

goop can perform google searches without being blocked by the CAPTCHA or hitting any rate limits.

Many Pentesters use Google Dork to help find some important informations about target site or to find digital data from someone. If you dont know what is Google Dork ?

Google Dork is a queries/keywords on search engine to find deep informations. See Example

But, if you try to search a few keywords together in one time and hit a limit or your keyword is dangerous, google will block you and ask you for CAPTCHA.

How it works?

Facebook provides a debugger tool for its scraper. Interestingly, Google doesn’t limit the requests made by this debugger (whitelisted?) and hence it can be used to scrap the google search results without being blocked by the CAPTCHA.
Since facebook is involved, a facebook session Cookie must be supplied to the library with each request.

Usage

Installation

pip install goop

Example

from goop import goop   

page_1 = goop.search('red shoes', '<your facebook cookie>') 
page_2 = goop.search('red_shoes', '<your facebook cookie>', page='1') include_omitted_results = goop.search('red_shoes', '<your facebook cookie>', page='8', full=True)

The returned is a dict of following structure

{
  "0": {
     "url": "https://example.com",
     "text": "Example webpage",
     "summary": "This is an example webpage whose aim is to demonstrate the usage of ..."


  },

  "1": {
...

cli.py demonstrates the usage by performing google searches from the terminal with the following command

python cli.py

Legal & Disclaimer

Scraping google search results is illegal. This library is merely a proof of concept of the bypass. The author isn’t responsible for the actions of the end users.

GitHub - Goop

6 Likes

Thanks @sam I will give a try.

1 Like

It no longer works. Google team told the creator it’s not a legitimate issue when he reported it to them but now they just silently fixed it.

2 Likes