OSINT
Google Dorking
Summary: How crawlers work & how to google dork
Room: https://tryhackme.com/r/room/googledorking
crawler indexes websites, note the keywords, search for insite urls, then crawl that url recursively
crawlers first check for
robots.txtyou might want to hide all
.inifile with/*.ini$in UNIX system, hide all
.conffilessitemap.xmlprovides the websute structure, helps with SEOhttps://pagespeed.web.dev/ -> google site analyzer, check speed performance
Web OSINT
Summary: there are so many ways we can do to uncover the owner of a website & unveil connections between websites
Room: https://tryhackme.com/r/room/webosint
https://www.namecheap.com/ -> check who owns the domain
Wayback machine -> check archived snapshot of website
ViewDNS.info -> anything about domain (history, etc)
Does a site feel like a legit source of info?
Language - What grade level is the writing? Does it seem to be written by a native English speaker?
UX - Is it user friendly? Is the design modern?
What pages does the site have?
https://ahrefs.com/blog/seo-best-practices/ -> SEO best practices
Search terms in the website's source code:
<!--is HTML commentsca-pubis Google Publisher IDua-is Google AdSense ID.jpgand other img exts. to reveal more directory structure
tools to check google codes: https://www.nerdydata.com/ and https://spyonweb.com/
common link between heat.net & purchase.org: in 2011-2012, their IP shares the same hosting (liquid web, l.l.c); around the same time as when the link is placed (according to wayback machine)
heat.net is likely a PBN (private blog network) to purchase.org; its sole purpose is to make purchase.org rank higher in search engine results
this is why heat.net doesn't seem "natural" to the eyes
Geolocating Images
Summary: how to know location from image
Room: https://tryhackme.com/r/room/geolocatingimages
Reverse Image Search
Best reverse image search: Yandex > Bing > Google
Yandex uses AI, it tries to get what's really in the picture
Google finds exact match
TinEye looks for exact duplicate
increase the image resolution, 200x200 and below is no hope
try mirrorring, cropping, rotating the photo
blurring out the photo subject can let the search engine focus on finding the background
Geolocating
rough estimate
get text, landmarks, road layouts
what is likely to be on the country/region
climate
popular brand of cars
driving side
etc
IP/ASN number
metadata/EXIF, social media geotagging
pinpointing
building floor: eye level and perspective
Last updated