In today’s article, we discuss the background of threat intelligence feeds.
Cyber Threat Intelligence (CTI) is stream-based detection of cyber threats, including network anomaly indicators. But this is just the tip of the CTI iceberg, the one consumed by SIEM (Security Information and Event Management) appliances.
It is essential to understand the need to prepare or curate CTI.
At CERT-XLM, we actively refine our CTI feeds. This paper discusses the importance of having usable feeds and how CERT-XLM achieves this..
What is a cyber threat intelligence feed?
Intelligence feeds contain indicators related to an identified or possible threat. CTI indicators are called IoCs (Indicator of Compromise) or IoAs (Indicators of Attack). An IoC is an indicator of a network security breach. They are used to identify malware signatures, known IP addresses or domains, and exploitation of vulnerable products/versions to reactively detect the compromise. An IoA recognizes the intention of attackers as well as suspicious activities that could lead to attacker persistence or lateral movements.
Indicators can be an IP address related to delivering malware, IP addresses corresponding to an attacker’s control servers (hereafter referred to as “C2”), URLs of phishing web pages, names or hashes of malicious files, email addresses, and so on.
Where to get your own cyber threat intelligence feed?
The market for CTI intelligence sources is not consistent. Security solutions such as anti-virus, firewalls, or proxy solutions can include proprietary CTI.
Virus Total [VT], Vx-underground [VX], or Abuse.ch [Abuse.ch] are open or public CTI sources that allow the downloading of a variety of samples or IOCs. These platforms are sometimes crowd-sourced with URLs or files submitted by users worldwide.
Platforms such as PhishTank [PhishTank] or OpenPhish [OpenPhish] allow users to report malicious sites that seek to extract user credentials or other sensitive information.
You can create your own CTI platform, which gathers and stores feeds that are relevant to you; this is what CERT-XLM does.
Some feeds are dedicated to one threat, such as OpenPhish for phishing or a particular family of malware for Abuse.ch. In contrast, others offer a wide variety of known threats, such as Malware Information Sharing Platform (MISP).
“Label all the things!”
The name of a threat is essential information for an analyst. There is no universal CTI naming convention, and each major vendor applies its own. For example, “Emotet” can be found under the name “Feodo,” “Heodo,” or “Geodo .”The Stealer “Pony” may be labelled “Siplog” or “Fareit”. QakBot is also called “Pinkslipbot,” “QBot,” and “Quakbot.”
There are common categories for indicators like “C2”, “phishing”, “malware”, “RAT”, “ransomware”, “scanner”, and “botnet”. Unfortunately, the categories are not universal. The MISP community platform has the concept of “galaxies”,, which each member can define. The result is multiple indicators associated with the same threat.
The CERT-XLM uniformly labels all threats based on MISP classification standards. Our curated CTI is then applied consistently through SIEM Use Cases.
Don’t lose control over your Cyber Threat Intelligence Feeds
Low-quality CTI feeds result in low-quality indicators.
Here is an example of a low-quality CTI indicator due to URL formatting issues:
- “http://${ip}:${port}/”
- “http://ttp://avorlen.xyz”
- “htttp://188.124.36.242:25802/”
Here is an example of a low-quality CTI indicator due to a content issue as it contains an RFC 1918 reserved IP address:
Here is an example of a low-quality CTI indicator based on being too specific. The indicator should not include the email address as this will vary based on the targeted user.
- “https://makeyouhappymg.ru.com/20?s3=ngo3g&s1=ppp11&email=valentino@acme.com”
You have to sanitize CTI indicators. Sanitizing CTI content protects your team from accidentally detonating malware. Here is an example:
- http://%D0%B2%D0%BE%D0%B4%D0%B0.net/kE9_6iaxBF_WWLBR8Mxnu
- the domain is not valid as is, it is actually decoded as “вода,” a valid domain, but it will have to be encoded in its internationalized form “xn--80adg3b” for a SIEM.
- https://viro(.)mleydier(.)en
- On line 2, the domain is protected by brackets, a common practice but left to individual preferences: one can also find square brackets instead, “hxxp” for “HTTP”.
CERT-XLM thoroughly tests, decodes, reencodes each indicator, and cleans and discards irrelevant ones.
Come on and grab your own!
Generic CTI content can result in the high volume creation of false positives.
Looking at the four examples below:
- “http://firebasestorage.googleapis.com”
- This example is a well-known content hosting service. This service can be abused, but without the exact page qualification path, the domain itself is not malicious.
- “https://forms.office.com/r/”
- It is common for attackers to use free online services to host malicious documents. They then invite their victims to download and open them as part of a phishing campaign to avoid detection. Without the identifier (e.g., hxxps://forms.office.com/r/ne89GVwrYE), this is not an indicator of a threat.
- “172.67.138.4”
- is an internal IP address. It corresponds to countless domains: Not all of these domains are malicious, and the analyst doing reverse DNS research on it must be able to classify the alert.
- “http://p”
- is a less common example. This CTI content will generate a lot of alerts in most environments.
CERT-XLM takes extra precautions to avoid CTI content that generates a high volume of false positive alerts.
Adapting your cyber threat intelligence feed to your SIEMs capabilities
It is important to have Secure Socket Layer (SSL) decryption as part of your security architecture. Your SIEM will have less useful information if you do not have adequate SSL decryption. Only the visited domains will be recorded.
Your SIEM may also have software limitations. The first limitation could be the number of indicators it can ingest at one time. Other limitations will come from the functionalities of the SIEM, as none of them is perfect.
Unfortunately for some SIEMs, no alert will be raised with a visit to the URL https://walletsdappvalidation.com/ when the indicator is “https://walletsdappvalidation.com/?u” as only exact matches are possible.
Test, test and test again
It is essential to validate your CTI content.
CERT-XLM measures CTI content in an isolated environment to reduce the risk of incidents from establishing connections with the indicators.
We then evaluate:
- how redundant the indicators are with the flows we already have. Measuring indicator matches things being blocked by your firewalls, proxies, anti-viruses, and other perimeter security solutions in your environment.
- What proportion of these indicators will generate false positives? This can be measured offline by looking for the indicators in the extraction of your traffic.
- How much reprocessing is required before ingestion by the SIEM? If there is no SSL termination in your traffic, at a minimum, you will need to extract domains from URLs. You will need to think of strategies to counter false positives as described above. Reprocessing will also involve cleaning up some of the recoverable indicators as described in Section 2.2.
- What is the proportion of waste? This involves looking for unusable indicators, as described in paragraph 3, to which you can add those that generate false positives.
Only these quantifiable answers will allow you to conclude whether the feed is worthwhile.
CERT-XLM also recommends the use of a passive DNS database. This approach can allow extended searches to be performed without consuming DSIEM resources. If matches are identified, a short period search in the SIEM will allow concluding with the full indicator if it is a false-positive or true-positive.
Cyber threat intelligence feeds: what to conclude?
There are a lot of details to keep in mind when integrating CTI content with a SIEM. This is why CERT-XLM recommends applying a structured approach to curating CTI content.