<?xml version="1.0"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>CKAN: Ticket #49: Filter Spam in Changes to CKAN Data</title>
    <link>http://localhost/ticket/49</link>
    <description>&lt;h1 id="AsA"&gt;As A&lt;/h1&gt;
&lt;p&gt;
sysadmin
&lt;/p&gt;
&lt;h1 id="IWantTo"&gt;I Want To&lt;/h1&gt;
&lt;p&gt;
Have revisions to the CKAN data filtered in order to reduce the spam in the system.
&lt;/p&gt;
&lt;h1 id="Details"&gt;Details&lt;/h1&gt;
&lt;p&gt;
In the long run this is a quite a generic problem common across several OKF systems and probably can become a general component in the okfmisc repo. For time being focus on a well-factored CKAN-specific solution.
&lt;/p&gt;
&lt;p&gt;
Suggest we follow path of trac: &lt;a class="ext-link" href="http://trac.edgewall.org/wiki/SpamFilter"&gt;&lt;span class="icon"&gt;​&lt;/span&gt;http://trac.edgewall.org/wiki/SpamFilter&lt;/a&gt;
&lt;/p&gt;
&lt;p&gt;
Could have a general engine that aggregates spam scores from many different 'plugins' and then marks spam appropriately (actions should be configurable depending on spam level from 'purge' to 'delete' (mark revision as inactive) to 'flag' to 'do nothing').
&lt;/p&gt;
&lt;p&gt;
Main initial plugins would be:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;regex filter (this would seem very useful here, e.g. do not allow urls in commit messages ...)
&lt;ul&gt;&lt;li&gt;could augment using the badcontent list approach (can find list on e.g. moinmoin)
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;spambayes and/or akismet
&lt;/li&gt;&lt;/ul&gt;</description>
    <language>en-us</language>
    <image>
      <title>CKAN</title>
      <url>http://assets.okfn.org/p/ckan/img/ckan_logo_shortname.png</url>
      <link>http://localhost/ticket/49</link>
    </image>
    <generator>Trac 0.12.3</generator>
    <item>
      
        <dc:creator>rgrp</dc:creator>

      <pubDate>Mon, 23 Jun 2008 18:06:08 GMT</pubDate>
      <title>priority changed; milestone deleted</title>
      <link>http://localhost/ticket/49#comment:1</link>
      <guid isPermaLink="false">http://localhost/ticket/49#comment:1</guid>
      <description>
          &lt;ul&gt;
            &lt;li&gt;&lt;strong&gt;priority&lt;/strong&gt;
                changed from &lt;em&gt;blocker&lt;/em&gt; to &lt;em&gt;minor&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;milestone&lt;/strong&gt;
                &lt;em&gt;v0.6&lt;/em&gt; deleted
            &lt;/li&gt;
          &lt;/ul&gt;
        &lt;p&gt;
Has become less of a problem since big efforts a few months ago so this can be downgraded for time being.
&lt;/p&gt;
      </description>
      <category>Ticket</category>
    </item><item>
      
        <dc:creator>rgrp</dc:creator>

      <pubDate>Tue, 03 Nov 2009 10:42:53 GMT</pubDate>
      <title>status changed; resolution set</title>
      <link>http://localhost/ticket/49#comment:2</link>
      <guid isPermaLink="false">http://localhost/ticket/49#comment:2</guid>
      <description>
          &lt;ul&gt;
            &lt;li&gt;&lt;strong&gt;status&lt;/strong&gt;
                changed from &lt;em&gt;new&lt;/em&gt; to &lt;em&gt;closed&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;resolution&lt;/strong&gt;
                set to &lt;em&gt;invalid&lt;/em&gt;
            &lt;/li&gt;
          &lt;/ul&gt;
        &lt;p&gt;
Not a problem at the moment.
&lt;/p&gt;
      </description>
      <category>Ticket</category>
    </item>
 </channel>
</rss>