Ticket #1467 (closed defect: worksforme)

Opened 2 years ago

Last modified 2 years ago

CKAN dumps dgu miss certain publisher information

Reported by: thejimmyg Owned by: thejimmyg
Priority: major Milestone: ckan-sprint-2012-01-09
Component: ckan Keywords:
Cc: Repository: ckan
Theme: none

Description

Pawel knows about this so David Read, Pawel and I need to find time to discuss it.

Change History

comment:1 Changed 2 years ago by dread

  • Owner changed from dread to jimmyg
  • Status changed from new to assigned

Leaving this to James to schedule

comment:2 Changed 2 years ago by kindly

  • Milestone changed from ckan-sprint-2011-11-21 to current-ckan-sprint-2011-12-05

comment:3 Changed 2 years ago by thejimmyg

  • Owner changed from jimmyg to thejimmyg

comment:4 Changed 2 years ago by rgrp

  • Milestone changed from ckan-sprint-2011-12-05 to current-ckan-sprint-2012-01-09

Moving to current sprint as this sprint is now long finished.

@jimmyg: please close, defer, update as necessary!

comment:5 Changed 2 years ago by thejimmyg

  • Status changed from assigned to closed
  • Resolution set to worksforme

The publisher issue seems to be resolved now, although during investigation I also found these issues:

  • 9 of the records don't have a published by and I wondered why
  • Lots of them are state=deleted (so do we really want to include these?)
  • We're still showing the deprecated agency field
  • Many of the departments are blank

Pawel is not available to work on these anyway at the moment, so let's pick them up as part of the disintegration work to migrate to CKAN. Marking the main ticket as "worksforme" since it does now.

comment:6 Changed 2 years ago by dread

I believe that the "publisher issue" that James alludes to is that the dump doesn't contain the 'parent publisher' field that is generated in the DGU system on the Drupal side. This information will be stored following the Groups Refactor #1477 and should be added to the dump at this point.

Excluding Datasets that are state=deleted is a good idea. I've split that off into #1623

The other issues mentioned are simply data quality - the same whether viewing the dump or elsewhere.

comment:7 Changed 2 years ago by dread

'parent publisher'

Sorry, I meant 'parent department'

Note: See TracTickets for help on using tickets.