webscan

Webscan tries to gather as much information from domains, IPs, and URLs as possible from an external perspective. It covers

DNS configuration
Domain and Nameserver ownerships
IPv4 and IPv6 availability
IP address ownerships
Blacklisting status
Open ports
TLS validity
TLS configuration safety
HTTP/HTTPS configuration with redirects
host-headers
cookies
html, js, css sizes
subdomains
and much more!

of a specified url or ip and gives improvement recommendations based on best-practices.

Installation

If you're feeling fancy:

curl -s https://raw.githubusercontent.com/thetillhoff/webscan/main/install.sh | sh

or manually from https://github.com/thetillhoff/webscan/v3/releases/latest.

Usage

webscan google.com # Scan domain and website
webscan 192.168.0.1 # Scan IP and website
webscan https://github.com/thetillhoff/webscan # Scan website at default port according to schema
webscan http://example.com:8080 # Scan website at specific port

webscan --help # Learn more about running specific scans

Features

DNS

Display dns information about the provided URL, and give improvement recommendations.

DNS mail security

Subdomain finder

IPv6 readiness

Check if both ipv4 and ipv6 entries were found.
- IPv4 is necessary to stay backwards compatible.
- IPv6 is recommended to be IPv6 ready.

IP analysis

Check who is the hoster of the IP address via RDAP (successor of whois) - like AWS, Azure, GCP, ...
Check if any IP (v4 and v6) of the domain is blacklisted.
- IPv4
- IPv6

Open ports

Check all found ipv4 and ipv6 entries for relevant open ports. Examples for relevant ports are SSH, FTP, SMB, SMTP, HTTP, POSTGRES
- Check whether FTP is disabled completely (only use SFTP or FTPS)
- Check whether SSH has password auth disabled and uses a secure configuration
Check ports in parallel, since the connection timeout is set to 2s, which can add up quite much.
Check if open ports match across all IPs.
If http detection feature is enabled, check HTTP and HTTPS ports even if this feature is not enabled.

SSL/TLS check

HTTP detection

By default webscan assumes you're using https. Yet, it will check whether it's available via http as well.

Optionally follow HTTP redirects (status codes 30x)
If http is available, it should be used for redirects only.
If https is availabe, it should either redirect or respond with 200 status code.
If both http and https are available
- and https redirects, check that either http redirects to https or to the same location that https redirects to
- and both are redirecting, the destination should be a https location
- and https does not redirect, http should redirect to it with 301 or 308 status code. (https://kubernetes.github.io/ingress-nginx/user-guide/nginx-configuration/configmap/#http-redirect-code)
Check which http versions are supported by the webserver (like HTTP/1.1, HTTP/2, HTTP/3 aka QUIC)

HTTP headers

Analyze host-headers of the response and recommend best-practices.
Check HTTP Strict Transport Security (HSTS / STS). This defeats attacks such as SSL Stripping, and also avoids the round-trip cost of the 301 redirect to Redirect HTTP to HTTPS.
CSP header settings (have one is the minimum requirement here)
- use nonce or hash for script
- use self, or at least https, warn for everything else -> https://storage.googleapis.com/pub-tools-public-publication-data/pdf/45542.pdf
Scan cookies
- amount
- length
- used characters

HTML content

Print recommendations on the html code.

SEO recommendations

check if robots.txt exists
check if sitemap exists
check if incompatible plugins like flash are used

Open todos or feature ideas

Bugfixes

WRN Couldn't check ip blacklisting because of error code: ip="46.62.145.190", response="[127.255.255.254]" What happened: ip blacklisting returns error code. Expected: try to parse the error code and print helpful text.
Subdomain scan should print helpful text in case of common errors, like too many requests etc. instead of displaying the error code.
List of subdomains should be filtered to show only subdomains, not all domains listed in the certificates.
HTTP header scan results contain Recommended action for Strict-Transport-Security: max-age value should be increased in stages from 15552000 to 63072000 (two years), which doens't make sense for http. Expected: don't print this for http.
find solution for crt.sh error - don't show it or whatever
`--subdomains triggers subdomainscan but not tls check, even though that's required for checking SANs of cert
HTTP content size is 5kB even though it just redirects to https? Is there a follow-redirect set?

Feature ideas

find better way to pass results of previous scans to the next scan. Ideas:
- target contains results, so it's not necessary to pass them as arguments to each scan. Advantage: Simple, easy to maintain. Risk: Circular dependencies.
- results as shared package, so they can be read by all scans. Advantage: Strong typing, easy to maintain. Risk: all packages depend on this, even if it's not used.
- use simple variables, which are then passed to the next scan. Advantage: Simple, works everywhere. Risk: complex, hard to maintain.
- use result variables in webscan package only, and use them to pass the "simple" result variables around. Advantage: Strong typing, easy to maintain. Risk: Is this possible?
Instead of fixed texts for recommendations and the likes, use types/enums/constants/functions for them. Make them comparable to each other, so f.e. http and https recommendations are comparable.
add reverse ip lookup to subdomain scan with resolver.LookupAddr.
add reverse dns lookup for ip addresses.

Search for TODO in code
Check README of pkg/status for todos
Check README of pkg/logger for todos
Check github issues https://github.com/thetillhoff/webscan/v3/issues
Ensure github actions build always has the correct version as output of webscan version
- add buildargs to example usage sections in all three repos for the actions
portscan on ipv4 and ipv6 might result in consistency-warning if your local machine only supports one of them! But sometimes it works anyway 🤷 add a note to the output after diving deeper
HTTP header scan results and HTTPS header scan results could be merged into one if they are equal. Also, if they redirect, they should not be displayed (don't follow redirects by default).
HTTP content scan results and HTTPS content scan results could be merged into one if they are equal. Also, if they redirect, they should not be displayed (don't follow redirects by default).
subdomainscan should check body of all responses in cache of httpClient for referenced subdomains
use ruleset approach for http/https and forwarding evaluation
add repo url to help text, maybe even Issues link
add support to read $1 arg from stdin
add support to structure output as json with --json-output
subdomainScan should list in alphabetical order, and include *. as dedicated host, but maybe italic or dimmed
Check readme of thetillhoff.de for insights (accessibility, other features, plus caddyfile, ...)
Check TTL for dns records and html caching
urls with or without ending slash / filename & extension input-path and redirect locations should either end with filename.ext or /. netlight.com is "wrong" while netlight.com/ or netlight.com/index.html are correct. The reason is that the part after the last slash might be tried to parse as filename by some applications. This is only a recommendation though.
Check favicons (https://css-tricks.com/favicons-how-to-make-sure-browsers-only-download-the-svg-version/, https://evilmartians.com/chronicles/how-to-favicon-in-2021-six-files-that-fit-most-needs)
Add webscan status or additional functionality to webscan version that checks if a new version is available (status could be used to check if internet connectivity is available, plus maybe scanning the local machine with it's public IP)
RE: timeout timing: as soon as the first response came, wait for one more second, then stop waiting and continue

httpHeaderScan results should be shown as list of items formatted like

- <Problem statement like "No CSP header set">;
  <Recommendation like "To incease the security of the website, implement a CSP header.">
  More information: <link>

Also, key-value pairs contains in CSP etc should be in separate lines each for better readability.

tlsScan results should be shown as list of items formatted like

- <Problem statement like "TLS ciphers using RC4 were prohibited to use by the IETF in 2015">;
  <Recommendation like "Remove the affected ciphers from your TLS terminator.">
  More information: <link>
  Affected ciphers:
  - <cipher name like "TLS_ECDHE_ECDSA_WITH_RC4_128_SHA">

tlsScan could compare against known configurations like aws-tls-configs with different versions. Then the recommendation can be more specific ("Are you using AWS? Find out how to change the available ciphers at <link>")
try to identify CMS system by checking known urls like /wp-includes/...
If a request fails (timeout, etc), skip it and the resulting scan. Example: netlight.com with https://netlight.com/wp-includes/css/dist/block-library/style.min.css\?ver\=5.3.2 Print warning in such a case
think about whether, and if yes, where to add https://ssl-config.mozilla.org/
add unit tests
- add tests for redirecting output to a file or pipe it and then write it into file
  - test stdout result
  - test stderr result
    - should include logs and error messages
Print IP RDAP info in pretty mode, depending on longest ip address
HTTP scan should check for latency, hops, download speed
HTML content scan should depend on content type of response; for example it should verify if it's valid json for application/json
list all domains that are referenced (like fonts.google.com, ...)
Outside of webscan: Add buildargs to example usage sections in all three repos for the gh-actions In other words: Ensure github actions build always has the correct version as output of webscan version Or: Add buildargs to example usage sections in all three repos for the actions
check if both ipv4 and ipv6 mx records exist (follow cnames on mx records automatically)
add functional tests with expected results on example website (github-pages?)
add check of version in tcp greeting / header message. openssh tells the client about it's version there.
check FTP headers
dns checks and dials time out on windows
add integration test to release pipeline, which runs webscan with a few sample runs on multiple platforms
add estimated location to ips and ASN info
check if quic uses udp on port 443, and incorporate it into scans
add post-quantum cipher verification to tls scan
create a new docs structure, as this one is getting too long

Name		Name	Last commit message	Last commit date
Latest commit History 272 Commits
.github/workflows		.github/workflows
.vscode		.vscode
pkg		pkg
.gitignore		.gitignore
.markdownlint.yaml		.markdownlint.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
DEVELOPMENT.md		DEVELOPMENT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
install.sh		install.sh
main.go		main.go
renovate.json		renovate.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

webscan

Installation

Usage

Features

DNS

DNS mail security

Subdomain finder

IPv6 readiness

IP analysis

Open ports

SSL/TLS check

HTTP detection

HTTP headers

HTML content

SEO recommendations

Open todos or feature ideas

Bugfixes

Feature ideas

About

Uh oh!

Releases 31

Uh oh!

Contributors 3

Uh oh!

Languages

License

thetillhoff/webscan

Folders and files

Latest commit

History

Repository files navigation

webscan

Installation

Usage

Features

DNS

DNS mail security

Subdomain finder

IPv6 readiness

IP analysis

Open ports

SSL/TLS check

HTTP detection

HTTP headers

HTML content

SEO recommendations

Open todos or feature ideas

Bugfixes

Feature ideas

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 31

Uh oh!

Contributors 3

Uh oh!

Languages