1HᗩᑕK's 2 BIG AD Space for rent! Click here for more details.

 $20 for x1 ad slot with over a Million page views per month! Only x7 slots are available - for booking contact below.

@TheJoker or email at [email protected]

UDdup | Urls De-Duplication Tool For Better Recon

image

The tool gets a list of URLs, and removes “duplicate” pages in the sense of URL patterns that are probably repetitive and points to the same web template.

For example:

https://www.example.com/product/123https://www.example.com/product/456https://www.example.com/product/123?is_prod=falsehttps://www.example.com/product/222?is_debug=true

All the above are probably points to the same product “template”. Therefore it should be enough to scan only some of these URLs by our various scanners.

The result of the above after UDdup should be:

https://www.example.com/product/123?is_prod=falsehttps://www.example.com/product/222?is_debug=true

Why do I need it?

Mostly for better (automated) reconnaissance process, with less noise (for both the tester and the target).

Examples

Take a look at demo.txt which is the raw URLs file which results in demo-results.txt.

Installation

With pip (Recommended)

pip install uddup

Manual (from code)

# Clone the repository.git clone https://github.com/rotemreiss/uddup.git# Install the Python requirements.cd udduppip install -r requirements.txt

Usage

uddup -u demo.txt -o ./demo-result.txt

More Usage Options

uddup -h

Short Form Long Form Description
-h –help Show this help message and exit
-u –urls File with a list of urls
-o –output Save results to a file
-s –silent Print only the result URLs
-fp –filter-path Filter paths by a given Regex

Filter Paths by Regex

Allows filtering custom paths pattern. For example, if we would like to filter all paths that starts with /product we will need to run:

# Single Regexuddup -u demo.txt -fp "^product"

Input:

https://www.example.com/https://www.example.com/privacy-policyhttps://www.example.com/product/1https://www.example2.com/product/2https://www.example3.com/product/4

Output:

https://www.example.com/https://www.example.com/privacy-policy

Advanced Regex with multiple path filters

uddup -u demo.txt -fp "(^product)|(^category)"

Contributing

Feel free to fork the repository and submit pull-requests.

Support

Create new GitHub issue

Want to say thanks? Message me on Linkedin

GitHub:

2 Likes
Friendly Websites

https://igg-games.com/ https://pcgamestorrents.com/ https://pirateiro.com/ ettvdl.com https://dodi-repacks.site/ https://crackingpatching.com/ https://glodls.to/ https://prostylex.org/ https://haxnode.com/ https://freedownloadae.com/ https://www.novahax.com/ https://www.sadeempc.com/ freecoursesonline.me ftuapps.dev