<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Avira - TechBlog &#187; URL-Blocker</title>
	<atom:link href="http://techblog.avira.com/tag/url-blocker/en/feed/en/" rel="self" type="application/rss+xml" />
	<link>http://techblog.avira.com</link>
	<description></description>
	<lastBuildDate>Thu, 19 Nov 2009 06:38:23 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Spam through Sourceforge.net (Update)</title>
		<link>http://techblog.avira.com/2009/06/29/spam-through-sourceforgenet/en/</link>
		<comments>http://techblog.avira.com/2009/06/29/spam-through-sourceforgenet/en/#comments</comments>
		<pubDate>Mon, 29 Jun 2009 08:37:17 +0000</pubDate>
		<dc:creator>Dirk Knop</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[URL-Blocker]]></category>

		<guid isPermaLink="false">http://techblog.avira.com/?p=961</guid>
		<description><![CDATA[Today happened what I thought it was impossible: I received spam on my username’s alias email address registered at sourceforge.net.
Sourceforge.net is the world&#8217;s largest open source software development web site. I have an account there since I was a student and started to work as volunteer for an open source project. I still do, even [...]]]></description>
			<content:encoded><![CDATA[<p>Today happened what I thought it was impossible: I received spam on my username’s alias email address registered at sourceforge.net.</p>
<div id="attachment_962" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2009/06/sfnet-spam.png"><img class="size-medium wp-image-962" title="sfnet-spam" src="http://techblog.avira.com/wp-content/uploads/2009/06/sfnet-spam-300x212.png" alt="Fig. 1: A simple but effective spam" width="300" height="212" /></a><p class="wp-caption-text">Fig. 1: A simple but effective spam</p></div>
<p><a title="SourceForge.net" href="http://sourceforge.net/" target="_blank">Sourceforge.net</a> is the world&#8217;s largest open source software development web site. I have an account there since I was a student and started to work as volunteer for an open source project. I still do, even if with not the same intensity as before. Sourceforge is known for its very aggressive anti spam measures. The Spamassasin software at sourceforge.net has detected correctly the email as spam, but why didn’t it stop it for being delivered?</p>
<div id="attachment_963" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2009/06/sa-headers.png"><img class="size-medium wp-image-963" title="sa-headers" src="http://techblog.avira.com/wp-content/uploads/2009/06/sa-headers-300x211.png" alt="Fig. 2: The mail is correctly flagged as spam." width="300" height="211" /></a><p class="wp-caption-text">Fig. 2: The mail is correctly flagged as spam.</p></div>
<p>The spam mail I&#8217;ve seen this morning consists just of one line of text. The only thing which allowed an anti spam filter to detect the message as spam was the fact the link inside was blacklisted because of hosting a spam website and that the IP address from the Received headers was already blacklisted.</p>
<p>So, everything is ok, but why did I receive the email even if it was flagged correctly? The website <a title="Sourceforge.net user mail aliases" href="http://sourceforge.net/apps/trac/sourceforge/wiki/User%20mail%20aliases " target="_blank">does say something</a> about the email aliases  that simply receive whatever comes there: “Any email sent to a user&#8217;s mail alias is automatically passed to the email address that is on file for a that account.”</p>
<p>Well, this is very nice &#8211; but very wrong:  To test this, I’ve sent an email from my email account at  work (domain is avira.com), but it was immediately whitelisted because of the many security features that our admins support (DKIM, Signatures, reverse DNS, etc.). So, it went through the filter.</p>
<p>I’ve sent another email from another email address having attached the well know GTUBE test file. Now everything was different, the email was blocked and I received immediately a nice email making fun of me:</p>
<div id="attachment_965" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2009/06/sf-nospam1.png"><img class="size-medium wp-image-965" title="sf-nospam1" src="http://techblog.avira.com/wp-content/uploads/2009/06/sf-nospam1-300x73.png" alt="Fig. 3: Spam that is not automatically forwarded." width="300" height="73" /></a><p class="wp-caption-text">Fig. 3: Spam that is not automatically forwarded.</p></div>
<p>So, why all this happened if Sourceforge doesn’t automatically forward any email sent to the users’ aliases? I don&#8217;t know, but I will surely ask Sourceforge. I will blog again if I receive the answer from them. Oh, by the way, Avira Premium Security Suite also correctly marks this kind of email as spam.</p>
<p><strong>Update:</strong></p>
<p>After writing to the Sourceforge Support an email, I received the answer below in less than an hour. I must say that I was pleasantly surprised for such a fast response time, considering the fact that Sourceforge gives all these services for free to the programmers.</p>
<p>&#8220;At SourceForge, we do our best to prevent spam from reaching our users.  However, it isn&#8217;t possible to prevent all spam from getting through, and you will occasionally see examples like the one you&#8217;ve provided.  We are constantly updating our filters and anti-spam techniques, though, so you should see this problem resolve itself in the next day or so.  If it persists, please let us know.</p>
<p>An additional step you can take is to filter based on the &#8220;X-VA-Spam-Flag: YES&#8221; header, which we apply to email we suspected of being spam.  Finally, we recently added the ability to control what sorts of email you receive through your email alias; you can find this feature on your Account Options page.&#8221;</p>
<p style="text-align: right;"><a href="mailto:sorin.mustaca@avira.com" target="_blank">Sorin Mustaca</a><br />
Manager International Software Development</p>
]]></content:encoded>
			<wfw:commentRss>http://techblog.avira.com/2009/06/29/spam-through-sourceforgenet/en/feed/en/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Providing protection against malware and phishing URLs</title>
		<link>http://techblog.avira.com/2008/11/11/providing-protection-against-malware-and-phishing-urls/en/</link>
		<comments>http://techblog.avira.com/2008/11/11/providing-protection-against-malware-and-phishing-urls/en/#comments</comments>
		<pubDate>Tue, 11 Nov 2008 08:32:32 +0000</pubDate>
		<dc:creator>Dirk Knop</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Phishing]]></category>
		<category><![CDATA[URL-Blocker]]></category>
		<category><![CDATA[WebGuard]]></category>

		<guid isPermaLink="false">http://techblog.avira.com/?p=201</guid>
		<description><![CDATA[Phishing, spam and malware have a couple of things in common: they have become a major problem for the users, for the banks and for online businesses. They are delivered either as attachments or via URLs contained in the emails. The AV industry is trying to protect its customers as good as it can by [...]]]></description>
			<content:encoded><![CDATA[<p>Phishing, spam and malware have a couple of things in common: they have become a major problem for the users, for the banks and for online businesses. They are delivered either as attachments or via URLs contained in the emails. The AV industry is trying to protect its customers as good as it can by gathering and analysing the emails with dangerous attachments and by blocking the URLs to phishing and malware websites.</p>
<p>Because the emails are so well crafted, sometimes it is not possible to mark them as SPAM, thus reaching users&#8217; inboxes. Some of these spam emails are spreading malware. Not only malware is nowadays a threat for the users but also phishing emails and websites which sell faked products which can be potentially dangerous as well (pharmaceutilcals).</p>
<p>The only solution to block access to the malware is to block the target URL in a generic way, without knowing for sure from the beginning the reason for which it is blocked. Such a powerful and dynamic system needs a very good control and monitoring center in order to be maintainable.</p>
<p>Avira developed a system in order to manage from a single point the malware and phishing URLs gathered from multiple sources, track the URLs in order to see that they are taken down, generate statics for detecting outbreaks and generate information to prevent companies when they are targeted by some phishing attacks.</p>
<div id="attachment_203" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2008/11/architecture-urlcheck1.jpg"><img class="size-medium wp-image-203" title="architecture-urlcheck1" src="http://techblog.avira.com/wp-content/uploads/2008/11/architecture-urlcheck1-300x176.jpg" alt="Fig. 1: Architecture" width="300" height="176" /></a><p class="wp-caption-text">Fig. 1: Architecture</p></div>
<p>The system is created having in mind that we can add at any time a new source of URLs.(represented by the gray source with a „?“)</p>
<div id="attachment_205" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2008/11/most-used-categories.jpg"><img class="size-medium wp-image-205" title="most-used-categories" src="http://techblog.avira.com/wp-content/uploads/2008/11/most-used-categories-300x87.jpg" alt="Fig. 2: Categories of URLs" width="300" height="87" /></a><p class="wp-caption-text">Fig. 2: Categories of URLs</p></div>
<p>As we can see, most of the URLs we block are pointing to malware and only about a quarter are pointing to phishing websites. These URLs are used to create updates for several web filtering products of Avira like Webguard, a module of the „Avira Premium Security Suite“ product.</p>
<p><strong>Features</strong></p>
<p>One of the most important features of the system is the ability to find the registrar which is hosting the phishing or the malware page. Once we find the registrar, we can find its location and create a world map of the sites which host malware and phishing.</p>
<div id="attachment_206" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2008/11/worldmap.jpg"><img class="size-medium wp-image-206" title="worldmap" src="http://techblog.avira.com/wp-content/uploads/2008/11/worldmap-300x184.jpg" alt="Fig. 3: World distribution of malware and phishing" width="300" height="184" /></a><p class="wp-caption-text">Fig. 3: World distribution of malware and phishing</p></div>
<p>As we can see in the Figure, most of the threats are hosted in U.S.A., followed by Europe. Another interesting statistic generated by the system is the top of the most attacked brands and the top of the providers which host most of the files.</p>
<div id="attachment_207" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2008/11/attacked_brands.jpg"><img class="size-medium wp-image-207" title="attacked_brands" src="http://techblog.avira.com/wp-content/uploads/2008/11/attacked_brands-300x299.jpg" alt="Fig. 4: Attacked brands (from September 2008)" width="300" height="299" /></a><p class="wp-caption-text">Fig. 4: Attacked brands (from September 2008)</p></div>
<p>On the first place in the top of the most attacked brands is eBay with 3277 unique phishing websites. On the second place is PayPal with 2606 websites and on the third place, very close to American Express with 2464 websites.</p>
<div id="attachment_208" class="wp-caption alignnone" style="width: 310px"><a href="http://techblog.avira.com/wp-content/uploads/2008/11/threatvariation.jpg"><img class="size-medium wp-image-208" title="threatvariation" src="http://techblog.avira.com/wp-content/uploads/2008/11/threatvariation-300x184.jpg" alt="Fig. 5: Number of threats" width="300" height="184" /></a><p class="wp-caption-text">Fig. 5: Number of threats</p></div>
<p><strong>Challenges</strong></p>
<p>Since end of September 2008 when the system was started, we encountered many challenges while creating this system. The challenges were caused by the differences between the sources we used: the URLs detected by our own Antiphishing product, Phishtank, LCheck (an internal system dealing only with Malware URLs) and Clean-MX ( a system that deals with both phishing and malware URLs). The only thing these sources have in common is the fact that they have an URL which should be blocked. Other challenges we faced are the errors and special situations these services produced: invalid data, lack of availability and false positives.</p>
<p>The system started to record about 100 new URLs at the beginning, which was not a great challenge for our hardware. The situation completely changed when we had to deal with almost 1000 unique URLs per day. These unique URLs are gathered from more than 20000 URLs which have to be verified and sorted. The server has to deal with these special situations and must also check the validity of the URLs by downloading each file in order to analyse and scan it.</p>
<p>A real challenge was removing non relevant URLs like those pointing to no longer existing websites and malware files. Usually, when a web resource is no longer available, a webserver is returning a special error (404). In order to become more user friendly, many websites are no longer returning this error but redirect to a special webpage informing the visitor that the requested resource is no longer there. Since the websites are very often hosted in non English speaking countries, it is not really a solution to parse the webpage and look for some known content.</p>
<div id="attachment_209" class="wp-caption alignnone" style="width: 310px">
<table border="1">
<tbody>
<tr>
<td><a href="http://techblog.avira.com/wp-content/uploads/2008/11/google.jpg"><img class="alignnone size-medium wp-image-209" title="google" src="http://techblog.avira.com/wp-content/uploads/2008/11/google-300x55.jpg" alt="" width="300" height="55" /></a></td>
<td><a href="http://techblog.avira.com/wp-content/uploads/2008/11/110mb.jpg"><img class="alignnone size-medium wp-image-210" title="110mb" src="http://techblog.avira.com/wp-content/uploads/2008/11/110mb-300x139.jpg" alt="" width="300" height="139" /></a></td>
</tr>
</tbody>
</table>
<p><p class="wp-caption-text">Fig. 6: Answers provided by various websites</p></div></p>
<p>Fortunately, by analysing some of these websites, we figured out that they use some common “keywords” and “key sentences” explaining what is happening. Many of these are international words. We filter about 60% of the pages with this empiric technique.</p>
<p>More details about various techniques for reaching the real content of a page are explained in the article „Delivering reliable phishing protection“, published in Virus Bulletin Magazine, May 2008.</p>
<p style="text-align: right;">Sorin Mustaca<br />
Manager International Software Development</p>
]]></content:encoded>
			<wfw:commentRss>http://techblog.avira.com/2008/11/11/providing-protection-against-malware-and-phishing-urls/en/feed/en/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
