Lessons Learned in Content Architecture Harmonization and Metadata Models

Purpose - This paper aims to review key content, architecture, and metadata model decisions and strategies in creation of a publication portal (on DVD to start), based on a 30+ year series of flagship reports from the World Bank. Design/methodology/approach - The paper describes and analyzes key con...

Full description

Bibliographic Details
Main Authors: Wagger, S., Park, R., Bedford, D. A. D.
Format: Journal Article
Language:EN
Published: 2012
Online Access:http://hdl.handle.net/10986/5379
id okr-10986-5379
recordtype oai_dc
spelling okr-10986-53792021-04-23T14:02:22Z Lessons Learned in Content Architecture Harmonization and Metadata Models Wagger, S. Park, R. Bedford, D. A. D. Purpose - This paper aims to review key content, architecture, and metadata model decisions and strategies in creation of a publication portal (on DVD to start), based on a 30+ year series of flagship reports from the World Bank. Design/methodology/approach - The paper describes and analyzes key considerations and aspects of the project, including content architecture, content analysis, DTD selection, retrospective conversion, vendor management, design of metadata architectures, use of automated profiling methods, user-information behavior, and search architectures supporting complex content architectures. It includes the challenges of applying an institutionally based taxonomy required to express subject-matter responsibilities and relationships within the World Bank. Findings - The team learned that the metadata behavior and architecture (inheritance, relationships, variations) are more complex than simple links between parent and child objects. The project also reinforced the importance of comprehensive and dynamic topic taxonomy for classifying content that is both historical and current. The approach to defining classes for each full report (parent) will be likely to change, given what has been learned. The team would recommend that parts be classified and the sum of the part classes be assigned to the whole report. As a result of this exploratory work, the Bank's approach to classification and indexing of report series is changing from a top-down to a bottom-up inheritance. Originality/value - The study provides insights into both general and World Bank-specific challenges in creating a publication portal and derives some best practices for content architecture, metadata architecture, and use of automated profiling methods. 2012-03-30T07:32:33Z 2012-03-30T07:32:33Z 2010 Journal Article Aslib Proceedings 0001-253X http://hdl.handle.net/10986/5379 EN http://creativecommons.org/licenses/by-nc-nd/3.0/igo World Bank Journal Article
repository_type Digital Repository
institution_category Foreign Institution
institution Digital Repositories
building World Bank Open Knowledge Repository
collection World Bank
language EN
relation http://creativecommons.org/licenses/by-nc-nd/3.0/igo
description Purpose - This paper aims to review key content, architecture, and metadata model decisions and strategies in creation of a publication portal (on DVD to start), based on a 30+ year series of flagship reports from the World Bank. Design/methodology/approach - The paper describes and analyzes key considerations and aspects of the project, including content architecture, content analysis, DTD selection, retrospective conversion, vendor management, design of metadata architectures, use of automated profiling methods, user-information behavior, and search architectures supporting complex content architectures. It includes the challenges of applying an institutionally based taxonomy required to express subject-matter responsibilities and relationships within the World Bank. Findings - The team learned that the metadata behavior and architecture (inheritance, relationships, variations) are more complex than simple links between parent and child objects. The project also reinforced the importance of comprehensive and dynamic topic taxonomy for classifying content that is both historical and current. The approach to defining classes for each full report (parent) will be likely to change, given what has been learned. The team would recommend that parts be classified and the sum of the part classes be assigned to the whole report. As a result of this exploratory work, the Bank's approach to classification and indexing of report series is changing from a top-down to a bottom-up inheritance. Originality/value - The study provides insights into both general and World Bank-specific challenges in creating a publication portal and derives some best practices for content architecture, metadata architecture, and use of automated profiling methods.
format Journal Article
author Wagger, S.
Park, R.
Bedford, D. A. D.
spellingShingle Wagger, S.
Park, R.
Bedford, D. A. D.
Lessons Learned in Content Architecture Harmonization and Metadata Models
author_facet Wagger, S.
Park, R.
Bedford, D. A. D.
author_sort Wagger, S.
title Lessons Learned in Content Architecture Harmonization and Metadata Models
title_short Lessons Learned in Content Architecture Harmonization and Metadata Models
title_full Lessons Learned in Content Architecture Harmonization and Metadata Models
title_fullStr Lessons Learned in Content Architecture Harmonization and Metadata Models
title_full_unstemmed Lessons Learned in Content Architecture Harmonization and Metadata Models
title_sort lessons learned in content architecture harmonization and metadata models
publishDate 2012
url http://hdl.handle.net/10986/5379
_version_ 1764394843440676864