Recon Series : URL Enumeration (Part 2)

This is part 2 of a 3 part series explaining reconnaissance (recon) for bug bounty hunting. In part 2, we explained different recon methodologies.

recon reconnaissance bugbounty infosec

Tuhin Bose

October 31st 2023.

Recon Series : Part 2

In the previous blog, we discussed how to perform subdomain enumeration and enumerate services on these subdomains. In this blog, we'll see the next part of performing successful recon.

Directory Bruteforcing

Directory bruteforcing is the act of guessing valid directories/files in the target web server. Sometimes it's possible to get some interesting pages (such as admin panel, backup/log files, env file, unauthenticated dashboard, etc) through directory bruteforcing. For performing directory bruteforcing, we need a list of interesting endpoints i.e. wordlist. We can use assetnote or trickest wordlists.

We're going to use ffuf for performing directory bruteforce.

1ffuf -c -w files.txt -u https://target.com/FUZZ
2

Fetching URLs from Public Archives

Directory bruteforcing is pretty helpful to find out interesting endpoints in the target server. However, it's a part of active recon and causes lots of noise in the web server which can trigger the WAF and block us.

An alternate way of directory bruteforcing would be fetching URLs from public archives such as Wayback Machine, AlienVault's Open Threat Exchange, Common Crawl, etc. These archives crawl the web and store the endpoints which can be viewed later.

In this section, we'll talk about 3 different tools that will fetch URLs from public archives and give us the output.

i. WayBackURLs

Waybackurls is a command-line tool for fetching URLs from the Wayback Machine (archive of websites that contains over 858 billion web pages). Sometimes, you can find some juicy information from waybackurls such as passwords, PII data, API keys, etc. We can either use the command-line tool https://github.com/tomnomnom/waybackurls or directly browse the URL: http://web.archive.org/cdx/search/cdx?url=*.target.com/*&collapse=urlkey&output=text&fl=original Since the number of URLs is quite big, we can search for the below keywords to extract sensitive information:

1password
2secret
3token
4access
5pwd
6api
7.json
8=http [For SSRF/open redirection]
9=%2F [For SSRF/open redirection]
10=/ [For SSRF/open redirection]
11email=
12@
13ey [For JWT tokens]
14.txt
15aws
16admin
17.js
18config
19dashboard
20oauth
21sql
22

These are some common keywords you can search for to extract sensitive data from waybackurls. During your testing, if you encounter some endpoint that reveals juicy info about your account (for example invoices, billing info, etc), you can search for the endpoint in waybackurl to check if you're able to access other users' data. For instance, if https://target.com/orders/1aq0b2chy9qar3w84chfr7ju5poa6 shows the order info of your order, you can search for the keyword target.com/orders/ to check if other similar endpoints are logged in Wayback machine.

ii. Gau

Gau is a similar tool that fetches known URLs from AlienVault's Open Threat Exchange, the Wayback Machine, Common Crawl, and URLScan.

1printf target.com | gau 
2

You can search for the same keywords after gathering the URLs. Also, there are multiple one-liners which will make your work easier. For example, if you want to get the JS files from gau and look for sensitive info inside it, you can use httpx and nuclei for this:

1echo target.com | gau | grep ".js" | httpx -content-type | grep 'application/javascript' | awk '{print $1}' | nuclei -t nuclei-templates/http/exposures/
2

You can find more such one-liners here

iii. Waymore

As per the official repository, the difference between Waymore and other tools is that it can also download the archived responses for URLs on wayback machine so that you can then search these for even more links, developer comments, extra parameters, etc.

1python3 waymore.py -i bugbase.in -mode U
2

Parameter Bruteforcing

Sometimes, it's possible to get some hidden parameters in some endpoint which may lead to further attacks such as cross-site scripting, SSRF, open redirection, etc. Generally, it's recommended to look for hidden parameters in pages where there are some kind of forms. For example, during one of my pen tests, I used arjun for parameter bruteforcing on a signup page. It gave me a parameter referral which was vulnerable to XSS.

1arjun -u https://bugbase.in/register -m GET,POST
2

Additionally, ffuf or param-miner can also be used for parameter bruteforcing.

Dorking

Dorking involves using advanced search queries to find sensitive or hidden information on web applications. In this section, we'll discuss about Google Dorking and GitHub Dorking.

i. Google Dorking

Google Dorking is the art of using advanced search queries in Google to search for specific keywords, file types, or parameters. Sometimes, developers expose sensitive endpoints (admin panel, log files, etc) to the internet which are later crawled and indexed by Google. We can use advanced search queries from GHDB or Bug Bounty Helper to find such sensitive pages.

ii. GitHub Recon

Many times, developers of the company accidentally push sensitive information into the GitHub repository. We can try to find out this information using GitHub dorks. Here are some keywords that we can search for:

1PASSWORD
2PWD
3KEY
4API
5TOKEN
6ACCESS_TOKEN
7SECRETKEY
8CLIENT-SECRET
9CLIENT_SECRET
10SECRET
11@target.com
12DEV
13PROD
14JENKINS
15CONFIG
16SSH
17FTP
18MYSQL
19ADMIN
20AWS
21DASHBOARD
22BUCKET
23ST NO
24CVV
25GITHUB_TOKEN
26=http
27OTP
28OAUTH
29AUTHORIZATION
30LDAP
31INTERNAL
32language:sql
33language:json
34language:txt
35

All we need to do is search for these keywords in GitHub after "target.com". For example, to look for passwords, we can search "target.com" password or "target.com" pwd in GitHub. We can also use GitDorker to automate the entire process.

P.S: Make sure to verify the credentials found from GitHub before reporting.

Aquatone

Aquatone is a tool for the visual inspection of websites across a large number of hosts and is convenient for quickly gaining an overview of the HTTP-based attack surface.
Let's say, you have a list of 1000 subdomains. Of course, you can't go through each of them one by one. Here comes Aquatone handy. It will take the list of subdomains, scan for popular ports that are often used by HTTP services, and take a screenshot of the resolved webpage. Additionally, it will generate a report that will allow us to view the targets in a list or in a graph view. We can look at which pages are identical and which aren’t. We can explore the headers returned for each endpoint and much more stuff.

1cat http.txt | aquatone
2

Nuclei

Finally, we're going to use nuclei for scanning vulnerabilities on all the subdomains.

1nuclei -l http.txt -t nuclei-templates/ -severity critical,high,medium,low
2

Recon Series : Part 2

Related Blogs

All About JWT Vulnerabilities

In this blog, we'll discuss JSON Web Tokens, the structure of JWT, and various vulnerabilities associated with JWTs

JWT Vulnerability Cyber Security Account Takeover

Published by Tuhin Bose on September 27th 2023

Exploiting post message vulnerabilities for Fun and Profit

In this blog, we talk about how to prevent, exploit and find post message vulnerabilities for cross origin communication

postMessage Same Origin Policy Cross Origin Communication XSS DOM-Based XSS Account Takeover Information Disclosure Bypass

Published by Bhavarth Karmarkar on October 13th 2023

Recon Series : Domain Enumeration (Part 1)

This is part 1 of a 3 part series explaining reconnaissance (recon) for bug bounty hunting. In part 1, I explained diffe...

recon reconnaissance bugbounty infosec

Published by Devang Solanki on October 18th 2023

Let's take your security
to the next level

Recon Series : URL Enumeration (Part 2)

Recon Series : Part 2

Directory Bruteforcing

Fetching URLs from Public Archives

i. WayBackURLs

ii. Gau

iii. Waymore

Parameter Bruteforcing

Dorking

i. Google Dorking

ii. GitHub Recon

Aquatone

Nuclei

Table of Contents

Let's take your security
to the next level

About

Quick Links

PRIVACY

SOCIAL

Copyright © 2024 BugBase All Rights Reserved

Recon Series : URL Enumeration (Part 2)

Recon Series : Part 2

Directory Bruteforcing

Fetching URLs from Public Archives

i. WayBackURLs

ii. Gau

iii. Waymore

Parameter Bruteforcing

Dorking

i. Google Dorking

ii. GitHub Recon

Aquatone

Nuclei

Table of Contents

Let's take your security to the next level

About

Quick Links

PRIVACY

SOCIAL

Copyright © 2024 BugBase All Rights Reserved

Let's take your security
to the next level