OSINT
Last updated
Last updated
Summary: How crawlers work & how to google dork
Room:
crawler indexes websites, note the keywords, search for insite urls, then crawl that url recursively
crawlers first check for robots.txt
you might want to hide all .ini
file with /*.ini$
in UNIX system, hide all .conf
files
sitemap.xml
provides the websute structure, helps with SEO
-> google site analyzer, check speed performance
Summary: there are so many ways we can do to uncover the owner of a website & unveil connections between websites
Room:
-> check who owns the domain
-> check archived snapshot of website
-> anything about domain (history, etc)
Does a site feel like a legit source of info?
Language - What grade level is the writing? Does it seem to be written by a native English speaker?
UX - Is it user friendly? Is the design modern?
What pages does the site have?
-> SEO best practices
Search terms in the website's source code:
<!--
is HTML comments
@
for
ca-pub
is
ua-
is
.jpg
and other img exts. to reveal more directory structure
tools to check google codes: and
common link between heat.net & purchase.org: in 2011-2012, their IP shares the same hosting (liquid web, l.l.c); around the same time as when the link is placed (according to wayback machine)
heat.net is likely a PBN (private blog network) to purchase.org; its sole purpose is to make purchase.org rank higher in search engine results
this is why heat.net doesn't seem "natural" to the eyes
Summary: how to know location from image
Best reverse image search: Yandex > Bing > Google
Yandex uses AI, it tries to get what's really in the picture
Google finds exact match
TinEye looks for exact duplicate
increase the image resolution, 200x200 and below is no hope
try mirrorring, cropping, rotating the photo
rough estimate
get text, landmarks, road layouts
what is likely to be on the country/region
climate
popular brand of cars
driving side
etc
IP/ASN number
metadata/EXIF, social media geotagging
pinpointing
Room:
the photo subject can let the search engine focus on finding the background
building floor: