Changes between Version 5 and Version 6 of SyncingInstances


Ignore:
Timestamp:
02/18/10 17:45:23 (4 years ago)
Author:
rgrp
Comment:

add new sections at the top

Legend:

Unmodified
Added
Removed
Modified
  • SyncingInstances

    v5 v6  
    1 = Syncing = 
     1= Distributed Data and Syncing Between CKAN Instances = 
     2 
     3Aim: '''to support pulling and pushing metadata between different CKAN registries while ''preserving'' history.''' 
     4 
     5This problem has strong similarities to distributed version control and distributed databases. 
     6 
     7= Research = 
     8 
     9  * Distributed version control systems: 
     10    * Mercurial -- for an overview [http://mercurial.selenic.com/wiki/Mercurial?action=AttachFile&do=get&target=Hague2009.pdf Inside a Distributed VCS] 
     11  * Distributed databases - NB: we are looking to preserve history (not just allow multiple distributed writes) 
     12 
     13= Use cases = 
     14 
     15== 1. data.gov.uk and ckan.net == 
     16 
     17Want to make data on data.gov.uk (hmg.ckan.net) available in a public CKAN instance. We will therefore end up with: 
     18 
     19  1. Package on data.gov.uk 
     20  2. Package on ckan.net 
     21 
     22Need to keep these two representations of the package in "sync". 
     23 
     24Remarks: This is easy if we only edit in one place.  But what if we want to let community edit on ckan.net? Two options: 
     25 
     26  1. have 2 copies on ckan.net one community owned and one locked down 
     27    * Pro: easy to keep stuff separate 
     28    * Con: terrible user experience and still have issue that two items can diverge 
     29  2. Have one copy that is world editable into which gets *merged back* into the official data every so often 
     30 
     31 
     32= Detailed Problem Description = 
     33 
     34Let us consider different CKAN registries: A,B,C 
     35 
     36Consider the following way we may want information to travel:: 
     37 
     38{{{ 
     39  A ------- 
     40           \ 
     41            C 
     42  A --> B--/ 
     43}}} 
     44 
     45In words: 
     46  1. Changes go directly from A to C 
     47  2. Changes go directly from A to B. Then changes from B are pulled to C 
     48 
     49= Specification v1 = 
    250 
    351== Scenarios == 
     
    1462 
    1563 * Use of Server A and Server B continues undisturbed during sync. 
    16  
    1764 
    1865== Issues == 
     
    100147 * Object method 'merge_diffs'. It takes an original object and two diffs that apply to it and applies them both in a new revision. 
    101148 
    102 == Related == 
    103  
    104  * About Mercurial http://mercurial.selenic.com/wiki/Mercurial?action=AttachFile&do=get&target=Hague2009.pdf