Ticket #1467 (closed defect: worksforme)
CKAN dumps dgu miss certain publisher information
Reported by: | thejimmyg | Owned by: | thejimmyg |
---|---|---|---|
Priority: | major | Milestone: | ckan-sprint-2012-01-09 |
Component: | ckan | Keywords: | |
Cc: | Repository: | ckan | |
Theme: | none |
Description
Pawel knows about this so David Read, Pawel and I need to find time to discuss it.
Change History
comment:1 Changed 2 years ago by dread
- Owner changed from dread to jimmyg
- Status changed from new to assigned
comment:2 Changed 2 years ago by kindly
- Milestone changed from ckan-sprint-2011-11-21 to current-ckan-sprint-2011-12-05
comment:4 Changed 2 years ago by rgrp
- Milestone changed from ckan-sprint-2011-12-05 to current-ckan-sprint-2012-01-09
Moving to current sprint as this sprint is now long finished.
@jimmyg: please close, defer, update as necessary!
comment:5 Changed 2 years ago by thejimmyg
- Status changed from assigned to closed
- Resolution set to worksforme
The publisher issue seems to be resolved now, although during investigation I also found these issues:
- 9 of the records don't have a published by and I wondered why
- Lots of them are state=deleted (so do we really want to include these?)
- We're still showing the deprecated agency field
- Many of the departments are blank
Pawel is not available to work on these anyway at the moment, so let's pick them up as part of the disintegration work to migrate to CKAN. Marking the main ticket as "worksforme" since it does now.
comment:6 Changed 2 years ago by dread
I believe that the "publisher issue" that James alludes to is that the dump doesn't contain the 'parent publisher' field that is generated in the DGU system on the Drupal side. This information will be stored following the Groups Refactor #1477 and should be added to the dump at this point.
Excluding Datasets that are state=deleted is a good idea. I've split that off into #1623
The other issues mentioned are simply data quality - the same whether viewing the dump or elsewhere.
Leaving this to James to schedule