Proactively Responding to Cloudbleed with Splunk

February 25, 2017•Anthony G. Tellez•5 min read

SplunkCloudflareCloudbleedIncident ResponseVulnerabilitySecurityThreat IntelligenceDNSSuricataURL ToolboxVulnerability ManagementSPL2017

What is Cloudbleed?

Cloudbleed is a serious flaw in the Cloudflare content delivery network (CDN) discovered by Google Project Zero security researcher Tavis Ormandy. This vulnerability means that Cloudflare leaked data stored in memory in response to specifically-formed requests. The vulnerability behavior is similar to Heartbleed, but Cloudbleed is considered worse because Cloudflare accelerates the performance of nearly 5.5 million websites globally. This vulnerability might have exposed sensitive information such as passwords, tokens, and cookies used to authenticate users -- to web crawlers used by search engines or nefarious actors. In some cases, the information exposed included messages exchanged by users on a popular dating site.

Understanding the severity of Cloudbleed

CDNs primarily act as a proxy between the user and web server, caching content locally to reduce the number of requests made to the original host server. In this case, edge servers in the Cloudflare infrastructure were susceptible to a buffer overflow vulnerability, exposing sensitive user information like authentication tokens. Technical details of the disclosure can be viewed on the Project Zero bug forum.

Evaluating risk with Splunk

The most obvious risk introduced by this vulnerability includes any exposed user data, which could be the same credentials users are using for corporate authentication. An easy way to enumerate the scope of this problem is to compare the list of domains using Cloudflare DNS against your proxy or DNS logs. This can give you some insight into how often users could be using the affected websites and the relative risk associated with using the same credentials for multiple accounts.

To do this analysis, first download the list of Cloudflare domains and modify the file so it can be used as a lookup:

$ git clone https://github.com/pirate/sites-using-cloudflare.git

Convert the text list to CSV:

$ cat sorted_unique_cf.txt | sed -e 's/^/"/' > sorted_unique_cf.csv
$ cat sorted_unique_cf.csv | sed -e 's/$/","true"/' > sorted_unique_cf_final.csv

Using a text editor, update the first line of the file. Change:

"","true"

to:

"domain","CloudflareDNS"

Finally, copy the formatted file to the lookups directory of the search app or a custom app used for security analysis:

$ cp sorted_unique_cf_final.csv /opt/splunk/etc/apps/security_viz/lookups/

After that step is complete, validate the lookup works:

| inputlookup sorted_unique_cf_final.csv

This might take some time because there are nearly 4.3 million domains in the lookup.

The domains on the list are not fully qualified domain names (FQDN), so they will be harder to match against proxy and IPS logs that include subdomains. Use URL Toolbox to parse the DNS queries or HTTP URLs in your IPS or proxy logs.

This is an example search of how to use URL Toolbox to parse Suricata DNS queries:

index=suricata event_type=dns
| lookup ut_parse_extended_lookup url AS query

In the example below, DNS queries are parsed and compared against the Cloudflare lookup. When a domain in the DNS query events matches a domain in the lookup, that event gets a new field called CloudflareDNS with a value of "True":

index=suricata event_type=dns
| lookup ut_parse_extended_lookup url AS query
| lookup sorted_unique_cf_final.csv domain AS ut_domain OUTPUT CloudflareDNS

Although the above search is helpful to identify whether or not a domain uses Cloudflare DNS, the next step is to use the new field to see only DNS requests for Cloudflare domains:

index=suricata event_type=dns
| lookup ut_parse_extended_lookup url AS query
| lookup sorted_unique_cf_final.csv domain AS ut_domain OUTPUT CloudflareDNS
| search CloudflareDNS=true

In this environment, approximately 4.3 million domains are being checked across a nine-month period (the entire span of available data). This is important because random user data was leaked and the credentials could have been compromised at any time.

Another useful technique is to save the results to a CSV file, because this search can take a long time. Saving to a lookup allows you to review the results later and filter through them without rerunning the search:

index=suricata event_type=dns
| lookup ut_parse_extended_lookup url AS query
| lookup sorted_unique_cf_final.csv domain AS ut_domain OUTPUT CloudflareDNS
| search CloudflareDNS=true
| stats count by src_ip ut_domain
| outputlookup all_affected_domains.csv

The final output from the search shows that during the time period specified, nearly half a million visits were made to websites using Cloudflare. Breaking it down by src_ip, specific users who were more frequent visitors to impacted sites can be identified. These users should change any reused credentials as a precaution.

Bonus: Use Dig to determine geography of impacted domains

Using a subset of data from the lookup, you can use sed and a script to dig for the IP address associated with each domain.

Reduce the results to only the domains:

| inputlookup all_affected_domains.csv
| dedup ut_domain
| rename ut_domain AS domain
| fields domain
| outputlookup cloud-bleed_domain-only.csv

Remove the quotes surrounding each domain in the file:

$ cat cloud-bleed_domain-only.csv | sed -e 's/"//g' > my_cloud-bleed_domains.csv

Run the following script to find all IP addresses for each domain:

$ bash domains.sh

After running the script, you will have a new lookup called dig_cloud-bleed_domains.csv which can be further analyzed in Splunk:

| inputlookup dig_cloud-bleed_domains.csv where "IP Address"=*
| rename "IP Address" AS ip_address
| iplocation ip_address
| stats count by Country
| geom geo_countries featureIdField=Country
| eval count="count: "+count

This choropleth map showcases the geographical location of each of the affected domains visited by users in the environment. The majority of the traffic was to sites based in the United States, with traffic to Germany in a distant second place.

Proactively Responding to Cloudbleed with Splunk

What is Cloudbleed?

Understanding the severity of Cloudbleed

Evaluating risk with Splunk

Bonus: Use Dig to determine geography of impacted domains

Related Articles

An Exercise in Threat Attribution: GRIZZLY STEPPE

Enhancing Enterprise Security for Ransomware

Analyzing Shadowbrokers Implants