Current ThreatQ Version Filter
 

Indicator URL Normalization

Remove Quotes from the Beginning and/or End of an Indicator

Single and double quote characters are removed if they are the first or last character of an indicator.

Remove Unneeded Spaces found within an Indicator

All spaces irrelevant of their position in the Indicator value are removed (when applicable).

Adjust Leading Protocol from Indicators

Indicators with a leading protocol [http://, https://, ftp://, or ftps://] are extracted and included as an attribute. When applicable, this indicator adjustment could change the indicator type from URL to FQDN.

Original URL indicator of http://evilsubdomain.no-ip.biz/ would convert to a FQDN = evildomain.no-ip.biz.

Adjust the Port from an IP Address

An IP address with a port [ex. 199.7.136.88:8143] will be truncated to the IP address and the port assignment will be added as an attribute.

Using the previous example the following indicator/attribute will be created:

Field Value
URL 199.7.136.88
Attribute > Port 8143

Adjust Defanged/Neutered Indicators

Indicators that have been defanged/neutered in order to “safely” share them (i.e. www [dot] 3322 [dot] org or badguy [at] gmail.com) need to be adjusted during import in order to ensure the indicators are properly deployed.

Create an IP Address from a URL (when applicable)

Using the previous example the following indicators will be created:

Field Value
URL 51.255.131.66/civis/viewforum.php
IP Address 51.255.131.66

Create a FQDN from a URL (when applicable)

When a URL contains a domain [ex. bat99-11611.co/gate777.php] a second indicator will be created for the domain [bat99-11611.co].

Using the previous example, the following indicators will be created:

Field Value
URL bat99-11611.co/gate777.php
FQDN bat99-11611.co

Query Strings in a URL Indicator

URL query strings are important but can significantly limit pattern-matching detection capabilities due to the likelihood of deviations, as well as, hamper the volume of URL indicators being deployed.

As such, the normalization process discards query strings.  

Maintain “WWW” on FQDN Indicators

When parsing or importing an FQDN the “www” will be maintained.

Replace and/or Remove Special Characters

Character Replacement

ASCII Values < 32
ASCII Values > 127

<space>
Ascii 96 -
Ascii145 '
Ascii146 '
Ascii147 "
Ascii148 "
Ascii151 -
carriage return and line feed <space>
Control Characters Remove
Convert to UTF8 Remove leading and trailing space, tab, newline, carriage return, vertical tabs and null characters.