<?xml version="1.0"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>CKAN: Ticket #888: Improvements to the dataproxy and the data API</title>
    <link>http://localhost/ticket/888</link>
    <description>&lt;p&gt;
First version of dataproxy and data API working (&lt;a class="closed ticket" href="http://localhost/ticket/698" title="task: CKAN Data API v1 (closed: fixed)"&gt;ticket:698&lt;/a&gt;) but have identified a variety of important improvements. (Should split these into sub-tickets ...):
&lt;/p&gt;
&lt;p&gt;
For dataproxy:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Testing for dataproxy
&lt;ul&gt;&lt;li&gt;Can start by using known good remote urls (moving forward could switch to providing/mocking these locally)
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Remove content-lenght for csv requirement: just read the first x rows (up to some configurable maximum)
&lt;/li&gt;&lt;li&gt;Google docs style row/column selections
&lt;/li&gt;&lt;li&gt;Use the swiss library - &lt;a class="ext-link" href="https://bitbucket.org/okfn/swiss"&gt;&lt;span class="icon"&gt;​&lt;/span&gt;https://bitbucket.org/okfn/swiss&lt;/a&gt;
&lt;ul&gt;&lt;li&gt;Support google docs spreadsheets (format = service/gdocs/ccc or gdocs/ccc or gdocs/spreadsheet)
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Handle redirects for content-length?
&lt;/li&gt;&lt;li&gt;Ignore resource type if not recognized and fall-back to trying to identify from extension (or mime-type?)
&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
For dataapi:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Ensure we pass on resource format as part of redirect i.e. /api/data/{id} -&amp;gt; {dataproxy}?url={resource-url}&amp;amp;type={resource-type}
&lt;/li&gt;&lt;/ul&gt;</description>
    <language>en-us</language>
    <image>
      <title>CKAN</title>
      <url>http://assets.okfn.org/p/ckan/img/ckan_logo_shortname.png</url>
      <link>http://localhost/ticket/888</link>
    </image>
    <generator>Trac 0.12.3</generator>
    <item>
      
        <dc:creator>Stiivi</dc:creator>

      <pubDate>Wed, 12 Jan 2011 11:04:57 GMT</pubDate>
      <title></title>
      <link>http://localhost/ticket/888#comment:1</link>
      <guid isPermaLink="false">http://localhost/ticket/888#comment:1</guid>
      <description>
        &lt;p&gt;
Chages to Data Proxy:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;tests added with configurable list of known URLs
&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;use brewery for transformations (included reference to brewery framework in a new vendor directory)
&lt;/li&gt;&lt;li&gt;side effect: code to make google find external packages in vendor directory (from now on, all external packages should go there and be referenced from .hgsub if they are mercurial repositories)
&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;changed response contents: moved from 'headers' to root, renamed 'response' to 'data', added field list as 'fields'
&lt;/li&gt;&lt;li&gt;changed way of registering transformers (now class object is used instead of name)
&lt;/li&gt;&lt;li&gt;added 'encoding' and 'dialect' parameters for CSV
&lt;/li&gt;&lt;li&gt;added optional data audit (parameter 'audit')
&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Changes: &lt;a class="ext-link" href="https://bitbucket.org/Stiivi/dataproxy/changeset/fccbdd275be5"&gt;&lt;span class="icon"&gt;​&lt;/span&gt;https://bitbucket.org/Stiivi/dataproxy/changeset/fccbdd275be5&lt;/a&gt;
&lt;/p&gt;
&lt;p&gt;
Data information: &lt;a class="ext-link" href="http://databrewery.org/doc/data_quality.html#brewery.dq.FieldStatistics"&gt;&lt;span class="icon"&gt;​&lt;/span&gt;http://databrewery.org/doc/data_quality.html#brewery.dq.FieldStatistics&lt;/a&gt;
&lt;/p&gt;
      </description>
      <category>Ticket</category>
    </item><item>
      
        <dc:creator>thejimmyg</dc:creator>

      <pubDate>Mon, 06 Jun 2011 09:28:57 GMT</pubDate>
      <title>owner, status changed; repo, theme set</title>
      <link>http://localhost/ticket/888#comment:2</link>
      <guid isPermaLink="false">http://localhost/ticket/888#comment:2</guid>
      <description>
          &lt;ul&gt;
            &lt;li&gt;&lt;strong&gt;owner&lt;/strong&gt;
              changed from &lt;em&gt;Stiivi&lt;/em&gt; to &lt;em&gt;thejimmyg&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;repo&lt;/strong&gt;
                set to &lt;em&gt;ckan&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;theme&lt;/strong&gt;
                set to &lt;em&gt;none&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;status&lt;/strong&gt;
                changed from &lt;em&gt;new&lt;/em&gt; to &lt;em&gt;assigned&lt;/em&gt;
            &lt;/li&gt;
          &lt;/ul&gt;
        &lt;p&gt;
I don't think any progress has been made on this for a bit so I'm assigning it to me.
&lt;/p&gt;
      </description>
      <category>Ticket</category>
    </item><item>
      
        <dc:creator>shevski</dc:creator>

      <pubDate>Fri, 08 Jul 2011 13:56:22 GMT</pubDate>
      <title>owner, milestone changed</title>
      <link>http://localhost/ticket/888#comment:3</link>
      <guid isPermaLink="false">http://localhost/ticket/888#comment:3</guid>
      <description>
          &lt;ul&gt;
            &lt;li&gt;&lt;strong&gt;owner&lt;/strong&gt;
              changed from &lt;em&gt;thejimmyg&lt;/em&gt; to &lt;em&gt;johnglover&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;milestone&lt;/strong&gt;
                changed from &lt;em&gt;ckan-v1.5&lt;/em&gt; to &lt;em&gt;ckan-current-sprint&lt;/em&gt;
            &lt;/li&gt;
          &lt;/ul&gt;
      </description>
      <category>Ticket</category>
    </item><item>
      
        <dc:creator>johnglover</dc:creator>

      <pubDate>Wed, 27 Jul 2011 13:25:03 GMT</pubDate>
      <title>status changed; resolution set</title>
      <link>http://localhost/ticket/888#comment:4</link>
      <guid isPermaLink="false">http://localhost/ticket/888#comment:4</guid>
      <description>
          &lt;ul&gt;
            &lt;li&gt;&lt;strong&gt;status&lt;/strong&gt;
                changed from &lt;em&gt;assigned&lt;/em&gt; to &lt;em&gt;closed&lt;/em&gt;
            &lt;/li&gt;
            &lt;li&gt;&lt;strong&gt;resolution&lt;/strong&gt;
                set to &lt;em&gt;fixed&lt;/em&gt;
            &lt;/li&gt;
          &lt;/ul&gt;
        &lt;p&gt;
Dataproxy / Dataapi now deprecated in favour of the combination of new QA archive / process commands and the webstore.
&lt;/p&gt;
&lt;p&gt;
Changes in relation to Dataproxy / Dataapi:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Currently only supports CSV files, but plans to add support for excel and google docs spreadsheets soon.
&lt;/li&gt;&lt;li&gt;Uses David Raznick's CSV parser instead of Brewery for parsing, handles messy CSV data better.
&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Changes in relation to old QA functionality:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;decoupled archiving (downloading) and QA process
&lt;/li&gt;&lt;li&gt;added a new 'process' command which parses downloaded files and adds them to a local webstore
&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Closing for now, any improvements/feature requests should be in tickets relating to either the QA functionality or the webstore.
&lt;/p&gt;
      </description>
      <category>Ticket</category>
    </item>
 </channel>
</rss>