Skip to main contentdfsdf

Kirsten McKnight's List: Records and Information Managementimport from google notebook

  • Apr 29, 09

    Discusses the value of off site storage and the costs of storing records on site

    • Risk management is critical to reducing exposure to litigation, audit and statutory compliance.
    • if no formal records management program is in place, “between 30 and 60 percent of its records are either inactive or semiactive and need to be destroyed immediately or transferred from prime office space to a low-cost storage facility.”

    1 more annotation...

  • Apr 23, 09

    Article about the importance of good legal holds processes particularly around exiting employees

  • Apr 23, 09

    How will we know what to preserve? Arguably, diaries, letters etc are of interest particularly when then individual in question is of interest to the public for whatever reason - but should things like GChat, Twitters be preserved. How do you determine what is of value? I don't think my diaries ranging back to when I was in grade school are of any value, nor transcripts of my chats or twitters or whatever.

    • Comments
      • Loving some of these comments!

      Add Sticky Note
    • All vice presidents choose where to keep their official papers, which are handed  over to the National Archives and Records Administration when they leave office.

    2 more annotations...

    • The U.S. Justice Department is urging Congressional leaders to  require mortgage lenders and servicers to retain borrower records for up to 10  years, as part of an effort to make it easier to prosecute fraud
    • But the collapse of the mortgage industry has hindered investigations, she  contended, as lenders, brokers and even title companies are largely going under  at a rapid rate and/or haven’t retained borrower records. In terms of  “subprime,” the catch-all that now seems to be synonymous with shady business  practices, 5 of the 10 top U.S. originators in the subprime market have since  folded or otherwise been acquired.
      • As far as I am concerned, here is a great argument for regulation of the banks and lending institutions. Gasp!

      Add Sticky Note
    • it  is offering a free personal health record on the Microsoft  HealthVault platform
      • Google and MS - head to head!

      Add Sticky Note
    • The Mayo Clinic Health Manager, as the PHR is known, will  provide not only a secure place to store medical records online, but also  guidance from Mayo experts that will be tailored to the information contained in  that PHR. If you are a 50-year-old man with diabetes and hypertension, for  example, you will receive information about how you should take care of those  conditions and the tests you should receive.
      • Still should be concerns around privacy, security, access, retention... granted this is not an in-depth article.

      Add Sticky Note

    3 more annotations...

  • Apr 22, 09

    Cloud computing, also known as Software as a Service (SaaS), according
    to one attorney is a corporate counsel nightmare. From a Canadian
    perspective I think one of the major issues is the Patriot Act - if you choose
    to use these services, such as Google, Yahoo, their servers are all
    based in the US and therefore any information stored using these services
    is subject to the Patriot Act. Major privacy implications....

    • The 2006 e-discovery amendments to the Federal Rules of Civil Procedure  (FRCP)  changed the legal and corporate information landscape, putting custody and  control at top of mind.
    • And that creates a big challenge for corporate counsel. How can they identify  "who, when and where" in the cloud? How can organizations handle document  retention? And to add another layer of worry, information targeted for the cloud  may also be subject to laws requiring privacy and persistent data integrity, and  other requirements that the storage manager may not even be aware of.
      • Not to mention if you are a Canadian company do you
        really want to be subject to the Patriot Act?

      Add Sticky Note

    2 more annotations...

  • Apr 22, 09

    An article with some interesting ideas about how to manage email in a health/patient environment. As pointed out by one reader though, email needs to be managed by its content; a crucial and key concept!

  • One Picture, 1,000 Tags

     
    By PAMELA LiCALZI O’CONNELL
    Published: March 28, 2007

    WHAT do you see when you see a work of art?

    Museums

    From the Bowery to the Bay Area, museums are expanding up and out in a frenzy of new construction and renovation.

    Strolling through a museum, a painting of a shipwreck catches your eye. You are struck by the dominance of blue in another work. Yet another painting, featuring a silvery moon, seems sad.

    If you try to find those paintings on the museum’s Web site you will probably fail unless you know the title or artist. You can’t search based on what you see.

    “Museums have recognized that their online collections are not doing the job — we’re hiding the content away from nonspecialists,” said Jennifer Trant, a partner at Archives and Museum Informatics in Toronto. “We’ve got to provide access on the same level as visual memory.”

    Now, after spending millions of dollars and years of effort on their virtual homes — which draw many more visitors than their physical ones — museums are rethinking their online collections. They are experimenting with one of the hottest Web 2.0 trends: tagging, the basis for popular sites like Flickr.com. In social tagging, users of a service provide the tags, or labels, that describe the content (of photos, Web links, art), thus creating a user-generated taxonomy, or folksonomy, as it’s called.

    Museums plan to encourage the public to annotate their collections by supplying descriptive tags that could exist alongside professional documentation, creating a new shared vocabulary. Van Gogh’s “Starry Night,” for example, could elicit tags like “stars,” “planets,” “swirls” or “insanity.”

    The Cleveland Museum of Art, the Smithsonian Institution and the Powerhouse Museum in Sydney, Australia, already have prototype tagging applications on their Web sites, and nearly a dozen other museums plan similar projects.

    But can the public be trusted to tag art? Will curators let them?

    The Metropolitan Museum of Art ran a test in fall 2005 in which volunteers supplied keywords for 30 images of paintings, sculpture and other artwork. The tags were compared with the museum’s curatorial catalog, and more than 80 percent of the terms were not in the museum’s documentation. Joachim Friess’s ornate sculpture “Diana and the Stag,” for example, was tagged with the expected “antler,” “archery” and “huntress.” But it was also tagged “precious” and “luxury.”

    “The results were staggering,” said Susan Chun, general manager for collections information planning at the Met. “There’s a huge semantic gap between museums and the public.”

    Based on this and other research, a group of museums formed the steve.museum tagging project, which recently received a two-year grant from the Institute of Museum and Library Services. The grant work, which began last fall, is based at the Met and the Indianapolis Museum of Art, and includes the Cleveland Museum of Art, the Denver Art Museum, the Guggenheim Museum, the Minneapolis Institute of Arts, the Rubin Museum of Art in Manhattan and the San Francisco Museum of Modern Art. People may tag selected art from these museums on the project Web site; some of the museums plan applications on their own sites as well.

    Aside from the prohibitive cost of subject indexing thousands of works, there are other reasons museums want the public to tag art. For one, “art professionals can find it surprisingly difficult to describe the visual elements of a picture,” said Ms. Trant, who is managing the grant work. She recalled that during early testing of tagging at the Met, a frustrated curator complained, “Everything I know isn’t in the picture.”

    “We would never say a work is mostly red, or instills a sense of ennui, or features a dog playing poker,” agreed Bruce Wyman, director of new technologies for the Denver Art Museum. “Tagging gives us a set of eyes we don’t have.”

    Since August 2006, the Smithsonian Photography Initiative has asked visitors to its Web site, photography.si.edu, to “Enter the Frame” and label 2,000 images culled from various archives. The tags typed in by users become immediately visible to them, but are not added to the database until a professional has reviewed them.

    “Our keywording was insufficient in a lot of ways,” said Effie Kapsalis, senior digital producer of the site. “There’s no taxonomic system that could cover the subjects of all these photographs. And we want a lot of tags for each image. So that’s why we turned to the public.”

    The tags range from the obvious and mundane to the impressionistic and personal. A photo of Greta Garbo was labeled “lonely” while one of boys dressed in Civil War uniforms inspired “innocence.” The tags create sometimes instructive, sometimes amusing links between disparate images, and these unexpected connections make the photos easy to browse through in a way museum sites rarely do.

    Will the public want to tag art? Most large museums have a healthy volunteer corps, and art lovers in general might jump at the chance to assist in museum work. There’s also a powerful Web ethos that spurs participation in “collective intelligence” projects, like the user-written Wikipedia. And, as tagging projects have revealed so far, some people also are likely to use tags to proclaim a personal connection with a work of art.

    A look at the tags on the steve.museum site reveals that for each work there is a tendency for a small number of tags to be assigned frequently and for a large number of terms to be assigned once. Take the case of John Singer Sargent’s “Madame X.” After the top five most common labels (woman, black dress, portrait, table, gown), the tags taper off into a long list of increasingly subjective terms (aristocrat, stiff, daring, snob, scandalous, etc.).

    If a tag is a user’s assertion that a work of art is about something, as the proponents of tagging suggest, then clearly everyone has a different idea of what “something” is. That’s a good thing, said Sebastian Chan, manager of Web services for the Powerhouse Museum in Sydney, which, since its site was redesigned last June, has encouraged tagging (powerhousemuseum.com). Paul McCarthy, a digital media manager in Sydney, has tagged numerous images on the Powerhouse site, often using lingo and street nomenclature, like the nickname “spacies” for the arcade game Space Invaders.

    “There’s real power in idiomatic lingo,” Mr. McCarthy said via e-mail. Tagging, he added, “unleashes the power of the vernacular. It brings the collection alive.” 

  • Cuisinarts, E-Commerce, and ... Controlled Vocabularies

    By Lou Rosenfeld,  Dr. Dobb's Journal
    Jan 01, 2002
    URL:http://www.ddj.com/184412772


    My friend related this interesting e-commerce experience to me recently.

    He wanted to buy a food processor. Being the sensible guy that he is, he wanted quality but didn't want to pay an arm and a leg for it. A used Cuisinart would fit the bill quite nicely. So my pal fired up his browser and went on his happy way to eBay, where he proceeded to enter a search for exactly what he thought he was looking for:

    "Cusinart."

    Now, my friend is a pretty bright guy. He has a PhD and a host of publications to his credit. He's been a successful entrepreneur and academic. Nowadays he's helping the federal government get up to speed on e-commerce. So misspelling Cuisinart is obviously independent of intelligence—something that could happen to anyone.

    It gets better. He submits his search and, voila, eBay has a "Cusinart" available. He bids successfully, and the misspelled "Cusinart" shows up on his doorstep a few days later.

    You think that's the end of story. It's not. My friend eventually realizes his mistake and out of curiosity goes back to eBay. This time he searches for the correctly spelled "Cuisinart" and finds lots of them listed—as well as lots of bids pending. More bidders mean higher prices. The upshot is that if my friend had spelled the word correctly in the first place, he'd probably would have spent roughly triple the amount for a "Cuisinart" rather than his bargain-priced "Cusinart."

    So what's my point? Well, it's not so much about how to make the most of your eBay experience (besides, there are already whole books on playing eBay to your advantage). Instead, it's just another illustration of how using controlled vocabularies can be the difference between a productive, efficient site and ... well, giving away your old Cuisinart at a third of its worth.

    Controlled vocabularies and the guessing game

    Whether you realize it or not, you're already familiar with controlled vocabularies. The Library of Congress subject headings and Yahoo's search criteria are a couple of examples. So, as you've probably guessed by now, controlled vocabularies are predetermined sets of terms that fit together to describe a specific domain such as kitchen appliances, nuclear engineering, or dirt biking.

    The terms are standardized because language is ambiguous. People use different terms to say the same thing all the time. Or, worse yet, the same terms can mean different things. Sometimes folks just honestly screw up—like my friend did.

    By predetermining the terms that make up a controlled vocabulary, and using those terms to describe your site's content, you can minimize the negative effects that variants, synonyms, and various other annoyances can have on your site and its users.

    Here's an example. Let's say you're a webmaster at AT&T. Your site describes a huge host of products; one of them is the One Rate plan. There are many pages in your site that deal with One Rate. The problem is that there isn't a standardized spelling for it. So, some pages are about the "One Rate" plan, others describe it as "1 Rate" and on and on. Here are some of the possible references:

    • One Rate Plan
    • OneRate Plan
    • 1 Rate Plan
    • The One Rate Plan

    In this case, a patient user might eventually guess the right variant. But, as we all know, patience is a rare commodity on the Web. Without an effective controlled vocabulary strategy, users who enter the wrong term would find nothing. Consequently AT&T would lose a number of new business opportunities. This is where e-commerce degrades into e-guessing.

    The guessing gets a lot more difficult with synonyms. Let's say a user visits a financial services site to find information on "income dividends." Would he or she have guessed that much of the content was listed under the following synonymous terms?

    • Dividend income
    • Income returns
    • Investment income

    It's unrealistic to expect users to even bother trying to make guesses. Which, if you've invested any amount of time or money into building your web site, is really bad news.

    Users who are browsing a site benefit from controlled vocabularies because they will find the information they need in one place under one heading. It also makes your life as a webmaster easier since you don't have to make up a new label for each piece of information you want to add to your site—just choose from the controlled vocabulary.

    An even better approach (though more work) is to consider expanding your controlled vocabulary into a thesaurus that includes variant, related, broader, and narrower terms, as well as glossary definitions. That way, a user browsing your site could, for example, look for "investment income" and be directed to content indexed under the standard term, "income dividends." Searching can be similarly improved—the user's query, "1Rate," would be enriched to automatically include "One Rate" and the variants that have already been mapped to "One Rate," terms that the user hadn't considered.

    Resources

    If you are going to explore using a controlled vocabulary, here are a few resources. Visit the American Society of Indexers web site, which includes a listing of web-based resources. You might even consider joining ASI, or hiring its members. You might also read Peter Morville's Web Architect column on developing a thesaurus where he defines "a controlled vocabulary that leverages synonymous, hierarchical, and associative relationships among terms to help users find the information they need."

    Tips

    • Consider using multiple controlled vocabularies to describe the same content. Organize each around a theme like product names, subjects, processes, audience types, and so on. Avoid mixing these themes together because that's mixing apples and oranges, making them confusing for users to understand and more difficult for you to maintain.

    • Balance the desire to use multiple vocabularies with the overhead involved in applying and maintaining each. Just as the domain changes over time, so should the terms used to describe it.

    • Plagiarize (I mean, borrow from) good vocabularies that are already found in similar web sites, not to mention the examples you'll find linked from the ASI site. You might even find some useful candidates in your organization's printed materials.

    • As described above, automatically enriching a search query behind the scenes can improve retrieval. But that doesn't mean it should be transparent to users who might not understand what's happening. Always provide at least a basic explanation of how your site's searching system works.

    • Finally, remember that a good controlled vocabulary not only describes a domain's content, but should also reflect the language of users. If you don't have a good feel for the kinds of words your site's users commonly use, then analyze your search engine's query log.

    Junk in, junk out

    Looking around the Web and you'll find a wasteland of chaotic vocabularies. It's almost amusing: with all the hype about e-commerce these days (somehow I remember commercial web sites existing long before I ever heard that term), webmasters are going nuts for taxonomies to describe their products and services. But you don't hear too much about what terms should be used to actually populate those taxonomies.

    Same thing goes for the corporate portal and the Yahoo-ized intranet. Vendors of portal software, XML-based approaches, and other products espouse metadata as solutions to the challenges of searching and browsing. Again, their solutions are only halfway there. They provide you with descriptive metadata fields, but you'll still need standardized terms to enter into those fields. Otherwise, it's simply another case of junk in, junk out.


  •  Taxonomies

    Value of Organized Knowledge

    by Jack Bryar
    21-Jan-2002


    An Old Problem Gets Worse

    In recent years, the volume of news and information resources available to the typical corporate employee has grown exponentially. Corporate Web traffic has jumped by over 600% annually. Web-available content exceeds several billion items. Executives frequently receive more than 200 e-mails a day. The amount of corporate data generated per employee doubles every 18 months.

      "If you printed the information available through our Intranet, it would stretch from the earth to the sun."
      -- Marc Auckland, World-Wide Chief Knowledge Manager, British Telecom

      Corporate managers are worried. Sixty percent say that info-glut is having a negative effect on productivity. IDC estimated that in 1999, US Fortune 500 companies lost $12 billion due to an inability to locate knowledge resources amidst all the clutter. Eighty percent of executives believe the problem will get worse before it gets better.

      Adding to the strain is the fact that this content is so difficult to access. Corporate information exists in many forms. Each form (electronic news, email, databases, Web pages, archived documents, etc) resides in its own format, accessible only though some unique index system. Often content is not easily accessible. Much of it is scattered across the enterprise.

      Yet, access is critical. New applications have sprung up requiring access to information across the enterprise, sometimes across multiple businesses. These include next-generation customer care, competitive research, and B2B transactions. In order to get at the information needed to run these applications, information itself needs to be re-structured, and re-organized -- and so does the method of getting at that information.

      Info-Illiteracy: A Barrier to Finding Information

      The problem of business info-glut is worse than it appears. Many employees lack the skills needed to find the information they require.

      For years, putting tools in the hands of the users was considered the best way for companies and their knowledge workers to get their hands on the information they needed. In most cases, that has meant providing users with a search engine similar to systems found on public Websites. Today, many knowledge workers have to navigate as many as six different search engines and database indexes each day.

      New research casts doubt on how well search engines works for most users. A study of AltaVista users revealed a surprising amount of info-illiteracy. According to that study:

      • 80% couldn’t/wouldn’t build a working Boolean search
      • 87% used less than 3 words

      A big part of the problem is that the same term can have different meanings to different people. Not knowing which terms will uncover sought-after information is a significant barrier for many knowledge workers. Any successful strategy for managing information has to overcome this problem.

        In 1814, Thomas Jefferson was so dissatisfied with the ruined and disorganized state of the documents at the Library of Congress that he donated his collection and then personally reclassified the all the books there.
        -- Source: Systems of Knowledge Organization for Digital Libraries, Gail Hodge

        XML to the Rescue?

        The Internet has been described as the world’s largest library, with the books thrown all over the floor. Many corporate information systems look just as disorganized. Information managers are convinced that the best solution to this clutter involves wrapping up all electronic document forms inside a common format, so that the content inside can be more easily found, and used by different applications.

        The wrapper being used by most organizations today is XML. XML allows the tagging of a document with a description of what the document is about, and where it came from. Searching on XML meta-tags can certainly simplify the search process.

        Unfortunately, XML does not solve the problem of finding information. It only standardizes the problem. It requires that any XML tagging system clearly understand what the document is about, and it needs to anticipate the search process someone might try to use to find it. This takes time, a great deal of sophistication, or both. Otherwise, the process results in hiding essential documents behind generic, idiosyncratic or meaningless tags, making the information management and retrieval problem even worse.

        In order for XML tagging to be meaningful for search and retrieval, the terms used to tag content have to be intuitive enough to encourage their use by information-seekers. They should be structured in a standardized way; less as a set of variable keywords and more like a set of subject categories. These subject categories should be set up in a hierarchical fashion, with logical subtopics and overviews. This, in short, is a taxonomy.

        Enabling an ability to search or manipulate content, "by category" is an essential benefit of a successful XML tagging process.

        Taxonomies Defined

        Taxonomies are sometimes called "classification schemes" or "categorization schemes." Each refers to grouping together similar items into broad "buckets" or "topics" which themselves can be grouped together in ever-broader "hierarchies." Examples of taxonomies include systems as diverse as the Dewey Decimal system found in small libraries, Yahoo’s Subject Index, and the massive taxonomic system proposed by Linneaus used by generations of biology students. Wherever they are used, they have the same goal -- to organize knowledge about a given subject.

        A sample taxonomy from NewEdge:

        Taxonomies and The Search Process

        Perhaps the greatest benefit to taxonomies is improved searching.

        Properly constructed taxonomies simplify the process of gathering "the right" information for daily business use by simplifying the vocabulary used in the search process. Tagging systems using raw key words or similar strategies are likely to generate search error rates approaching that of straight text searches. For example, while a search on the word "DSL" will find stories on a particular type of broadband technologies, it will miss others, and may accidentally find content referring to Dutch sign language or Data SubLanguage.

        A better approach would be to define these documents as belonging to the subject category, "Digital Subscriber Line." If the searcher can focus on a proven set of categories rather guess at keywords, chances of finding the right content, are far greater, and the process will be faster and more reliable.

        The most important contribution of taxonomies to the search process is that they work.

        Even using a relatively primitive taxonomic system, Microsoft reported a 40% improvement in hit rates. Satisfaction metrics doubled. In addition, the time spent trying to find a given document was significantly reduced. The success rate of taxonomic-based searching reduces the strain on systems and on the people who use them.

        Business is Complicated

        Naturally, one of the most important criteria for taxonomy is that it should be easy to navigate. But building solid taxonomies is much easier said than done. Consider, for example, a taxonomy of business subjects.

        Businesses vary in size and have multiple points of focus. Business activities involve an array of subjects that do not always fall into logical groupings. Subject boundaries are often fuzzy.

        Subject hierarchies can feel artificial, as content, particularly business critical content, may fall into multiple categories. Indeed, most executive-level business documents involve several categories. Traditional indexing schemes dissolve in complexity as the number of unique concepts grows.

        So, while some subjects are relatively easy to categorize, most business functions are not. (I should know: NewsEdge has spent several years developing a proprietary business taxonomy). Nevertheless, you should seriously consider developing a taxonomy for the content management system residing underneath your e-business efforts in general, and your Intranet in particular. Your content contributors and end-users alike will be grateful.

         

        Next:

        Send Feedback

        See all ECM Channel feature articles.

        Need to select a technology vendor, but confused about your choices? See our vendor-neutral technology reports.

        Join the conversation

        Digg This | Search Technorati | Tag it on Del.icio.us


        About the Author

        Jack Bryar is a Practice Leader in the Knowledge Management Consulting and Editorial Services unit of NewsEdge Corporation. Bryar has helped NewsEdge clients to determine their information needs in industries ranging from energy production to international banking.  Prior to NewsEdge, Bryar led the I.T. Practice of a corporate advisory services consultancy based in Toronto, Canada.




  • Barbara Blackburn
    5/31/2006

    So you’ve decided you need a taxonomy to categorize and organize your documents and records. But how do you decide what type of taxonomy to design? The type of taxonomy you choose is as important as the taxonomy itself. If the design doesn’t meet the needs of the users; it will not be used.

    Taxonomy Types
    Taxonomies are usually hierarchical. Categories (nodes) in the hierarchy progress from general to specific. Each subsequent node is a subset of the higher level node. There are three basic types of hierarchical taxonomies: subject, business-unit, and functional.

    A subject taxonomy uses controlled terms for subjects. The subject headings are arranged in alphabetical order by the broadest subjects, with more precise subjects listed under them. An example is the Library of Congress Subject Headings (LCSH) used to categorize holdings in a library collection (see the example to the right). The Yellow Pages could be considered a subject taxonomy.

    It is difficult to establish a universally recognized set of terms in a subject taxonomy. If users are unfamiliar with the topic, they may not know the appropriate term heading in which to begin their search. For example, a person searches through the Yellow Pages for a place to purchase eyeglasses. They begin their search alphabetically by turning to the E’s and scanning for the term “eyeglasses.” Since there are no topics titled “eyeglasses,” the person consults the Yellow Pages index, finds the term eyeglasses, which provides a list of preferred terms or “see also” which directs the person to “optical – retail” for a listing of eyeglass businesses.

    LIBRARY OF CONGRESS SUBJECT HEADINGS

  • H -- SOCIAL SCIENCES
  • J -- POLITICAL SCIENCE
  • K -- LAW
  • L -- EDUCATION
  • M -- MUSIC AND BOOKS ON MUSIC
  • N -- FINE ARTS
  • P -- LANGUAGE AND LITERATURE
  • Q – SCIENCE
  • R -- MEDICINE
  • - Subclass RA Public aspects of medicine
  • - Subclass RB Pathology
  • - Subclass RC Internal medicine
  • - RC31-1245 Internal medicine
  • - RC49-52 Psychosomatic medicine
  • - RC251 Constitutional diseases (General)
  • - RC254-282 Neoplasm. Tumors. Oncology
    Subject Taxonomy Sample

    COUNTY GOVERNMENT BUSINESS-UNIT TAXONOMY

  • Assessor
  • Building
  • Commissioners
  • Coroner
  • District Attorney
  • Finance
  • Health and Environment
  • Human Resources
  • Human Services
  • Motor Vehicle
  • Clerk and Recorder <------------
  • Department
    • Elections <------------ Divisions
    • Motor Vehicle
    • Recording

    • - TD1000 <-------- Records
    • -- Warranty Deed
    • -- Quit Claim Deed
    • -- Subdivision Plat
    • Sheriff
    • Treasury
      Business-Unit Taxonomy Sample
      In both examples (LCSH and Yellow Pages), the subject taxonomy is supported by a thesaurus. A thesaurus is a controlled vocabulary that includes synonyms, related terms, and preferred terms. In the case of the Yellow Pages, the index functions as a basic thesaurus.

      In a business-unit taxonomy, the hierarchy reflects the organizational charts (e.g., department/division/unit). Records are categorized based on the business unit that manages them. The example above and to the right shows the partial detail of one node of a business unit taxonomy that was developed for a county government.

      One advantage of a business-unit taxonomy is that it mimics most existing paper- filing system schemas. Therefore, users are not required to learn a “new” system. However, conflicts arise when documents are managed or shared amongst multiple business units. For example, in the county government example above, a property transfer document called the “TD1000” is submitted to the recording office for recording and then forwarded to the assessor for property tax evaluation processing. This poses a dilemma as to where to categorize the “TD1000” the taxonomy. Another issue arises with organizational changes. When the organizational structure changes, the business-unit taxonomy has to change as well.

      In a functional taxonomy, records are categorized based on the functions and activities that produce them (function/activity/ transaction). The organization’s business processes are used to establish the taxonomy. The highest or broadest level represents the business functions. The next level down the hierarchy constitutes the activities performed for the function. The lowest level in the hierarchy consists of the records that are created as a result of the activity (e.g., transactions).

      The example above shows partial detail of a functional taxonomy developed for a state government regulatory agency. The agency organizational structure is based on regulatory programs. Within the program areas are similar (repeated) functions and activities (e.g., permitting, compliance, enforcement, etc.). When the repeated functions and activities are universalized, the results are a “flatter” taxonomy. This type of taxonomy is better suited to endure organizational shifts and changes. In addition, the process of universalizing the functions and activities inherently results in broader and more generic naming conventions. This provides flexibility when adding new record types (transactions) because there will be fewer changes to the hierarchy structure.

      One disadvantage of a functional taxonomy is its inability to address case files (or project files). A case file is a collection of records that relate to a particular entity, person, or project. The records in the case file can be generated by multiple activities. For example, at the regulatory agency, enforcement files are maintained that contain records generated by enforcement activities (Notice of Violation, Consent Decree, etc.) and other ancillary, but related activities such as contracting, inspections, and permitting. To address the case file issue at the regulatory agency, metadata crossreferencing was used to provide a virtual case-file view of the records collection.

      ARTWORK GOES HERE Which Taxonomy Type Should You Use?
      Each taxonomy type has its pros and cons. In most cases, a Hybrid approach combining the taxonomy types is the most appropriate.

      In choosing a taxonomy type, consider the following:

      • Gain an understanding of your organization and how the business units function and interact.
      • What are the needs of the users (both internal and external users)? Will you need multiple “views” or methods for records searching and categorization?
      • Where will the taxonomy be applied and what are the operating parameters or limitations of those systems? (electronic content or records management system, paper files, shared network drive, website, etc.)
    • Barbara Blackburn is a senior consultant with IMERGE Consulting (www.imergeconsult. com). She specializes in document and records management technology planning, selection, and deployment. Ms. Blackburn is also an instructor for AIIM’s ERM Certificate program. Contact her at (barbb@imergeconsult.com).

  •  Information Connection: Data Stewardship Using the Zachman Frameworkby Jonathan G. Geiger

    Data stewardship is a multi-faceted function which places the ultimate organizational responsibilities for data quality. Frequently, people refer to a data steward, as though a single person can perform all the activities. While a single person or group of people may ultimately be responsible for data quality, the stewardship functions are carried out by several people. The Zachman Framework perspectives are useful for determining the organizational responsibilities for different aspects of data stewardship.

    The planner perspective provides an enterprise view. In this view, the major subjects of the organization are identified. One way of identifying the major subjects is to bring together business people from each of the major functional areas of the company so that the major subjects can be identified. A typical list of major subjects would include customers, products, competitors, facilities, human resources, materials, etc. Agreement on the list of major subjects, and the scope and definition of each, needs to be gained at an enterprise level. The business people who were instrumental in defining the major subject areas can form the nucleus of a stewardship council which would provide an enterprise view toward the data.

    In addition to defining the subjects and their content, this council would also set the policies concerning the management of data as an enterprise resource. These policies would address data creation, maintenance, dissemination (including security) and disposition from a generic viewpoint, with the responsibility for executing the policies for individual data elements or subjects being under the control of the appropriate people.

    In the owner perspective, the entities and attributes within each subject area are identified and defined from a business perspective. The relationships between pairs of entities and the domain of business values are also defined in the owner perspective. To a large extent, the work performed in this perspective is done within individual subject areas. Hence, it is appropriate to designate, for each subject area, a business person as the data steward to oversee the development of the owner perspective. The stewardship council would provide guidance for matters involving multiple subject areas or if consensus cannot be reached on the subject area within which an entity belongs.

    Data analysts within data administration should be responsible for developing the (system) data model which governs the technical implementation of the business model. Data administration defines the overall naming standards and, therefore, may need to rename some of the entities for representation within the physical databases. Interaction with the business people is warranted to ensure that the resultant model is semantically consistent with the business model. If discrepancies arise, both parties need to be part of the resolution.

    Once the system model is defined by the data analyst, a person within database administration must define the physical tables which preserve the integrity of the system model and meet the designated performance requirements. This is done in the builder view.

    In the builder view, physical databases and tables are defined. The technical model used in this view may differ from the system model because of access or performance considerations. As with other views, some collaboration may be needed between data administration and database administration. Database administration needs to ensure that the referential integrity implied in the system model is preserved. The database administrator is then ready to create the physical databases and tables in the subcontractor view.

    The physical databases are created by database administration in the subcontractor view. With modern repositories and CASE tools, consistency with the builder view is easy to guarantee, as the creation of the physical environment can be automated. The product of the process is the actual database which is eventually populated.

    The final product is the set of populated tables. Data for these tables is often provided by individuals, and these individuals need to be accountable for the data entered.

    The data stewardship roles exist to ensure the integrity of the product--the populated databases and tables. Without this integrity, the value of any database is very limited. Throughout the process, the stewardship functions are performed by different people. First, a group of people defines the major subjects of interest. These people need to have an enterprise-wide business perspective. Then the contents of each subject area of interest are defined by a person with a specific business perspective. The result is translated into an electronic representation of the business by a person with business and data analysis skills who applies information resource management principles to make the transformation. Finally, a person with technical database skills defines and creates the physical tables.

    These functions are designed to help ensure that the data entered meets the enterprise needs. However, unless the person entering the data is also concerned with the data quality, the data may pass all the electronic validation rules and still be invalid.

    This column focuses on the data stewardship responsibilities within the data dimension of the Zachman Framework. The steward also addresses the other dimensions. For example, the processes which are defined for ensuring the data quality are declared within the process dimension, the physical distribution of the data is declared within the locations dimension, the security rules are declared within the people dimension, the currency requirement and retention rules are declared within the times dimension, and the governing business rules are declared within the motivations column.


    Jonathan G. Geiger is executive vice president at Intelligent Solutions, Inc.  Geiger has been involved in many corporate information factory and customer relationship management projects within the utility, telecommunications, manufacturing, education, chemical, financial and retail industries.  In his 30 years as a practitioner and consultant, Geiger has managed or performed work in virtually every aspect of information management.  He has authored or co-authored numerous articles and three books, presents frequently at national and international conferences, and teaches several public seminars.  Geiger may be reached at JGeiger@IntelSols.com.

    Copyright 2007, SourceMedia and DM Review.

  • DATA STEWARD ROLES & RESPONSIBILITIES
    Written by Robert S. Seiner
    KIK Consulting & Educational Services, LLC & TDAN.com
    [Return to the Article Archive]

    © Copyright 2005 - Robert S. Seiner  

    Published in TDAN.com July 2005 

    Introduction

    If you define Data Stewardship the same way that I define Data Stewardship, then we are apt to agree that defining data steward roles and responsibilities is at the heart of best practices for implementing a successful data stewardship program.

    The definition that I use for Data Stewardship is “the formalization of accountability for the management of organizational data”.  By stating “formalization of accountability” I mean that there are already “de facto” stewards of data in every organization that informally steward the data from several perspectives.  The way I see it, you wouldn’t have any data if “someone” wasn’t responsible for defining what data was/is needed, “someone” wasn’t responsible for producing the data per the definer’s specifications, and “someone” wasn’t going to use the data in the first place.  These people that define, produce and use the data are the “TRUE” data stewards of the organization.

    The questions are: Do they recognize themselves as data stewards?  Are they recognized as data stewards?  If they are already recognized as data stewards – do they know what it means to be a data steward or what a data steward is “really” supposed to do?  Is there anybody to assist them in becoming the “best” steward they can be by providing them a framework, discipline and education on how they and others can best manage the data of the company?  Is there anybody to guide them through paying specific attention to stewarding the data that they are accountable for?  Data steward roles and responsibilities are a critical component to successfully implementing a data stewardship program.

    As an easy way to articulate data steward roles and responsibilities, I typically break the roles and responsibilities into two different categories – 1) Data Stewardship Program Roles – pertaining to those individuals that define, design and execute the administration of the data stewardship program and 2) the Participating Data Steward Roles – pertaining to individuals that are responsible for daily operations and the management (definition, production and usage) of enterprise data.  The Program Roles are often overlooked and seldom become a focus of a data stewardship program.  I believe this oversight is a mistake and thus in this article I have paid specific attention to defining both the Program and the Participating data steward roles and responsibilities. 

    There is also a role for a person(s) that you may have initially identified or assigned “THE” data steward for a specific datum or set of data in your company.  Read on for a further description of where I see “THE” data steward fitting into the picture (and believe me, they play a critical role).

    Above the Line, Below the Line

    The next portion of the article defines the different type of roles and responsibilities that you may want to consider to include in the definition of both the program and participating roles.  Figure 1 below displays a separation of the Program Roles and the Participating Steward Roles - which I refer to as the "above the line" roles and the "below the line" roles.


    Figure 1 - Steward Roles Above and Below the Line

    Data Stewardship Program Roles

    The Data Stewardship Program Roles focus on setting up the Data Stewardship Program.  The individuals that play the program roles are not the Data Stewards themselves and most often the only data that the program roles are responsible for is the data about the stewards and the data about the data they steward.

    I describe these roles to be "above the line" drawn across the middle of Figure 1.  These roles focus on understanding the requirements for the data stewardship program, understanding the desired outcome of the data stewardship program, and they focus on architecting the best possible data stewardship solution for the organization.  More often then not the program roles originate in the IT organization even though the data stewards themselves reside in the business organizations.  The Data Stewardship Program Roles are responsible for finalizing participating steward roles and responsibilities, defining how data stewards will be "recognized" (not assigned), defining the database for collecting information about data stewards, and disseminating the data steward information (meta-data relating stewards to the data they steward) to those individuals that will benefit from using the stewardship meta-data.

    Data Stewardship Program Manager

    As part of the Program Roles, the Program Manager operates, manages and oversees the entire Network and Program.  The Program Manager coordinates the efforts of the Project Manager, Subject Matter Managers, and the Participating Data Stewards (see below for descriptions of these roles).

    Program Manager Responsibilities (may delegate to Project Manager defined below):

    • Working with the Project Manager to design and implement the Program for the company according to the company’s best practice definition of their data stewardship program.

    • Working with the Project Manager to create and provide educational materials and guidance to Subject Matter Managers and Participating Data Stewards.

    • Working with the Project Manager to identify (with the Subject Matter Managers) the appropriate Participating Data Stewards including Data Definers, Data Producers and Data Users as well as Subject Matter Managers.

    • Working with the Project Manager to ensure that the data stewardship information is available and accessible through the a stewardship meta-data repository that contains information that links Participating Data Stewards to the data they steward.

    • Working with the Project Manager to ensure that the processes pertaining to data stewardship information in the are followed to assure that the program is current/active and that the data stewardship information maintained in the is kept up-to-date.

    Data Stewardship Project Manager

    The Project Manager (“Project Manager”) will work in cooperation with the Program Manager to coordinate the efforts of the Subject Matter Managers and the Participating Data Stewards.

    Project Manager Responsibilities (may delegate to an Analyst assigned to the Program Manager):

    • Working with the Program Manager to design and implement the Program for the company according to the company’s best practice definition of their data stewardship program.

    • Creating and providing educational materials and guidance to Subject Matter Managers and Participating Data Stewards.

    • Identifying (with the Subject Matter Managers) the appropriate Participating Data Stewards including Data Definers, Data Producers and Data Users as well as Subject Matter Managers.

    • Ensuring the data stewardship information collected is available and accessible through the repository.

    • Ensuring the Data Stewardship Change Management procedures are followed to assure that the program is current/active and that the data stewardship information recorded in the stewardship repository is kept up-to-date.

    Participating Data Stewardship Roles

    Participating Data Stewards have specific responsibilities depending on their relationship to the data within their respective business areas or departments.  These Participating Data Steward roles are divided into three primary categories including Data Definition Stewards (Data Definers), Data Production Stewards (Data Producers), and Data Usage Stewards (Data Users).

    By identifying, recording and maintaining information about the Data Definers, Data Producers, and Data Users, your company can formalize the entire “chain of custody” and accountability of data for the organization.  The following is a brief description of the Participating Data Stewardship Roles.

    Subject Matter Managers (SMM)

    Housed in the Business areas within your company (depending on the subject matter of data being stewarded), Subject Matter Managers are responsible for coordinating Participating Data Steward activities with the Program Manager and Project Manager to address data quality issues and concerns regarding the data in that subject matter.

    The Subject Matter Manager is often identified as "THE" data steward for data that falls under their subject area.  BUT ... This person is often somewhat removed from the day-to-day operations of defining, producing and using the data therefore they are not "THE ONLY" data steward of the data.  "THE" data steward may have a high level of accountability for the management of the data but they often are positioned at a level that is too high to participate in the daily responsibilities for the data.  For that reason, the Subject Matter Manager's should be responsible for identifying (not assigning) the hands-on participating stewards and also the daily coordination of the Participating Data Steward activities.

    Subject Area Manager Responsibilities:

    • Identifying (with the Program Manager) and coordinating the activities of the appropriate Participating Data Stewards including Data Definers, Data Producers and Data Users of data that falls under the subject matter.

    • Ensuring the Data change management procedures are defined, followed and enforced to assure that the program is current/active and that the stewardship information in the repository is kept up-to-date.  This may involve periodic reviews of the procedures and enforcement of these

    Data Definition Stewards (Definers)

    The Data Definition Steward is primarily a business role however there will be opportunities for technology-focused individuals in the areas of IT infrastructure asset management areas to put business definition to technology-based data.

    Data Definition Steward Responsibilities:

    • Identifying the specific data that is needed to operate the business processes of their areas.

    • Recording business definition and appropriate meta-data (information about the data such as business name, business description, valid values, etc.) for the data they define by utilizing the tools and processes that have been identified and that are supported by the Data Stewardship Program Manager's resources (above the line).

    • Identifying opportunities to share and re-use data. 

    • Identifying data quality standards. 

    • Participating in the enforcement of data quality standards and identifying and resolving to data quality issues pertaining to the data that they define.

    • Ensuring the the quality, completeness and accuracy of data definition.

    • Identifying the specific data that is needed to operate the business processes of their business areas.

    • Recording business definition and appropriate meta-data (information about the data such as business name, business description, valid values, etc.) for the data they define by utilizing the tools and processes that have been identified and that are supported by the IT organization.

    • Identifying opportunities to share and re-use data. 

    • Identifying data quality standards. 

    • Participating in the enforcement of data quality standards and identifying and resolving to data quality issues pertaining to the data that they define.

    • Ensuring the quality, completeness and accuracy of data definition.

    • Responsible for communicating new and changed business requirements to individuals who may be impacted

    • Responsible for communicating concerns, issues and problems with data to the individuals that can influence change

    More than one Data Definition Steward can exist per business area, business process, sub-process, data type or business data element.

    Data Production Stewards (Producers)

    The Data Production Steward can be both either a business and technical role although it is typically considered a technical role.  The Data Producer Steward will be considered synonymous with a businessperson who is responsible for entering and modifying data in the enterprise data bases and information systems.

    Data Production Steward Responsibilities:

    • Producing (inserting, updating, deleting) business and technical data in the IT systems that support the business processes.

    • Validating data that enters and exits business processes.

    • Coding and editing accurate data quality standards (including format, content, and data dependency) for the data they produce.

    • Ensuring the quality, completeness and accuracy of data production according to the definition of the data provided by the Data Definition Stewards.

    • Responsible for communicating new and changed business requirements to individuals that may be impacted

    • Responsible for communicating concerns, issues, and problems with data to individuals that can influence change

    More than one Data Producer Steward can exist per process, sub-process, data type or business element specifically when business and technical data and meta-data is produced and passed from an information system or business entity to the next. 

    Data Usage Stewards (Users)

    The Data Usage Steward is primarily a business role (depending on the source of the data) related to the usage of data.  Data Users potentially include constituencies outside of the organization who may or may not be included under the guidance of Data Governance. 

    Data Usage Steward Responsibilities:

    • Accessing and Using the data for its intended purpose.

    • Accessing information (meta-data) available about how the data was defined for the business and how the data was produced in the information systems in order to use the data for its intended purpose.

    • Ensuring the quality, completeness and accuracy of data usage according to the definition of the data provided by the Data Definition Stewards.

    • Responsible for communicating new and changed business requirements to individuals that may be impacted

    • Responsible for communicating concerns, issues, and problems with data to individuals that can influence change

    There are typically more Data Usage Stewards then there are data definers and data producers.

    The last two bullets in each of the Participating Data Steward Roles described above are identical and they are very important.  The Participating Data Stewards are the “eyes and ears” of the organization when it comes to knowing the data.  If a Participating Data Steward identifies a problem with the definition, production and/or usage of data, they must be responsible for bringing that issue to the attention of someone who can bring on change.

    Conclusion

    Flying in the face of my whole 3D approach to data stewardship (click here to see an old article I wrote called Data Stewardship in 3D) I am sad to say that many of the companies that I have worked with have initiated data stewardship programs by identifying a person (singular) as an “owner” or “THE” data steward of a particular piece or set of data.   Does this sound like you?

    My approach for defining data steward roles and responsibilities states that, more often than not, one participating steward per datum or data set is not effective or enough.  Especially if “THE” data steward is significantly far removed from the daily definition, production and usage of the data they are stewarding.  “THE” data steward may be formally accountable for the data – but this article suggests that companies should consider identifying the data definers, producers and users of the data and educate them on the roles and responsibilities defined here.  You will not be sorry you did.

    This short article has described a very practical, pragmatic and most importantly an actionable approach to defining data steward roles and responsibilities in your organization.  This article should help you to determine who the “TRUE” data stewards are.

    Once your organization has identified the data stewards the next question becomes – “what do we do next?” or “how do we get these stewards to truly participate in the data steward role?”  That is certainly a very good question and one that will be discussed in a future TDAN.com article.  If you need to know sooner, contact me at rseiner@kikconsulting.com and we can talk about it.

    © Copyright 2005 - Robert S. Seiner
      


    Robert (Bob) S. Seiner is recognized as the publisher of The Data Administration Newsletter, LLC – www.TDAN.com - an award winning electronic publication that focuses on sharing information about data, information, content & knowledge management disciplines. Mr. Seiner speaks often at major data management and meta-data management, business intelligence and knowledge management related conferences and user group meetings across the U.S. He can be reached at the newsletter at rseiner@tdan.com or 412-220-9643.

    Mr. Seiner is the President and Principal Consultant of KIK Consulting & Educational Services, LLC – www.KIKconsulting.com - a company that focuses on consultative mentoring, a blend of consulting and educational services to enable companies to improve and gain value from the management of their data and information resources. Mr. Seiner offers 2-day in-house and public courses on how to build and implement data governance / stewardship programs and meta-data programs.  He can be reached at the consulting firm at rseiner@kikconsulting.com.

     

  •  
    by William Laurent

    Summary: Having first-class data significantly helps all aspects of your organization's business - helping to effectively gauge and manage risk, avoid redundant work loads, increase customer satisfaction and provide better business intelligence and decision-making support.

    The Sarbanes-Oxley Act of 2002 (particularly Section 404), has created a sense of urgency for senior executives to take an active interest in the accuracy, consistency and timeliness of their data. Now more than ever, corporate compliance, audit and governance issues must be efficiently resolved. Yet it is widely believed that less than half of large companies have a formal data stewardship or data quality program that protects and leverages their unique strategic asset of data. Smart businesses are proactive, not reactive; they comprehend the business need for quality data. Aside from compliance and governance agendas, having first-class data (and the ability to unlock it) significantly helps all aspects of your organization's business - helping to effectively gauge and manage risk, avoid redundant work loads, increase customer satisfaction (on the front and back ends) and provide for better business intelligence and decision making support. Conversely, poor data quality will have injurious consequences across all enterprise operations - negatively impacting the most critical customer service obligations (such as billing and remittance processing), skewing decisions and fulfillment obligations, and creating a host of other tactical problems. Numerous hidden business costs such as revenue leakage and higher capital expenditures may all be maladies that stem from lack of data integrity. While there are tools that can help with and augment your formal methodology, capable data stewardship requires the symbiotic merging and integration of the automated (technology) with the manual (people). For many organizations, data problems remain secreted or hidden until issues are uncovered that result from a formidable event of costly business impact, such as an external audit, fraudulent activity, subpoena or valued customer satisfaction issue. A prolific variety of complex legacy data (which may go through many iterations of manual reconciliation on its way up to senior management's eyes) lingering in cloaked and "siloed" structures will always be poised to wreak havoc when least expected. Today, with the advent of widespread Internet and e-commerce applications, the data problems of an enterprise can be exposed to the entire consumer world. If you have bad data on your customers (e.g., not recognizing them across accounts), they will know - often long before you catch on. The time to establish a formal data stewardship program is now!

    When implementing a formal stewardship program or policy for your organization, it will be important to appoint the right person as the lead steward and assign the steward an appropriate title. This person will understand and approach duties from the perspective that data quality is a collaborative business and IT matter as he/she champions high-quality data across all multinational systems - from repositories to reports. The most effective stewards will be familiar with core business values and practices but should also be able to understand data models, tech-speak and data storage topics from a high and low level. (A systems analyst background with relevant industry understanding and outstanding communication skills will be ideally suited to bridge this sometimes giant gap between technology and the business.) A strong leader and a people person, the lead steward will educate and expand many people's horizons about proper data governance and the consequences of unreliable data on business objectives; the steward will make others accountable for continuous improvements in the caliber of data. Lead stewards should be visible senior-level people who are respected and well liked in the organization, with the ability to motivate and envision change from a high level. They should be empowered by senior management and steering committees to directly address issues and manage standards-based implementations from both a business and technology-centric view, brandishing their "data police badge" when staff members resist data standards and the added responsibility or loss of control that come with such regulations. A VP-level title of data steward is not out of the question.

    Stewards must have specific and measurable goals for data quality, making sure that public data helps enforce and promulgate vital business rules and processes. A viable formal stewardship policy will be rooted in ongoing standards that identify goals, priorities and quality metrics within all systems infrastructure elements (from data warehouses to OLTP applications) and business functions that touch or affect data. The steward will tie business strategy to data strategy, applying generally accepted qualitative metrics and heuristics (risk management, cost benefit analysis, change management, etc.) to the measuring and enforcement of data quality. With increasingly aggressive life cycle timelines and rapid application development (RAD) paradigms, organizations should not have to keep worrying about the veracity of data from conversion to conversion, re-inventorying and re-reconciling data elements piecemeal every time a new systems integration project is tackled. Without ongoing business driven tools and touchstones for data assessment and improvement, your data stewardship program is doomed to fail. Only a continuous and structured methodology will bring long-term benefits, not a sustained culture of quick data fixes. For larger enterprises, this methodology will have functions that leverage time-honored calculations, derivations curves and benchmarks for measuring quality. The business rules of the corporation should be documented and maintained alongside data models, data dictionaries, meta data repositories, etc., so that a spirit of common responsibility, knowledge and ownership is cultivated. The formal stewardship policy will reside in a public place (corporate intranet) along with other operating policies so that employees who create, use and own the data develop a sensitivity to issues of data rectitude.

    If data quality improvements are to continue to be the norm, the impetus must come both through technology architecture and business requirements. It must be understood and evangelized throughout the company that many times business continuity is data continuity: data problems invariably turn into business ones, increasing a firm's exposure to various unanticipated risks. In contrast to purely technical issues, a data steward will sometimes have to address and analyze data problems that are attributable to business routines that have a large human/manual element, and thus cannot be changed easily. The steward will need to have a business understanding and the wherewithal (support of senior management) to carry out the reengineering of various business processes. This may necessitate that they have a cursory understanding of various types of unstructured data (such as that found in electronic document management systems) and data that is not stored digitally, as well as the integration challenges associated with them. In such cases, a steward may have to help engineer a balance between the manual processes - routing paper requests, bar-coding, manual archiving - of a company's information center and records management databases and software.

    Under a stewardship program, everybody shares responsibility and accountability for data excellence; information belongs to everybody, and like other (more empirical) corporate resources, it must be scrupulously managed and cared for. The stewards do not own the data; they define and bestow ownership and accountability accordingly, as they train, guide and mentor others in data quality best practices, rewarding compliance and adherence to quality specifications. Everybody that touches data throughout the organization must understand their role in data quality and be able to provide a feedback loop that will help stop bad data habits from propagating throughout the business. There will be pockets of resistance. However, a good lead data steward must know how to deal with the political side of the job. Drawing up a data consumer matrix, which describes who uses what data elements (and what actions are taken with that data), is always an invaluable exercise for assigning data trusteeship. This will help create a culture of data stewardship along all parts of the information life cycle and data supply chain - from the most important reference data to the most transitory of meta data.

    In order to accomplish a firm foundation for continuous and measurable integrity of data, stewards should create and foster various partnerships and cross-functional teams (from business and technology) based on pre-approved agreements that govern systems not only inside, but also outside the aegis of enterprise control. Studies have shown that employee data entry errors are responsible for more than 75 percent of bad quality data. However, due to intensified e-commerce activity and broadening reliance on third-party data feeds, potential sources for substandard data are rising exponentially for many businesses. External third-party data sources that will commingle with (or feed) your internal system schemas should meet predefined standards of information quality and management before reaching the sanctity of your organization's gateway. Data suppliers must be held to a service level agreement (SLA) that states their responsibility and interest in providing quality data from a value-added information environment.

    There is no one-size-fits-all job description for a data steward; stewardship roles will evolve according to the topology of IT infrastructure/architecture and the principal nature of the business. Care should be taken so that the steward is not saddled with more responsibility than he/she can handle. It will sometimes be good to have a hierarchy of steward roles or spheres of influence, where co-stewards oversee data probity issues along predefined systems or business segment domains in order to make data policy more manageable. Another possibility is to create a data stewardship committee that will share general responsibility for defining quality metrics; assessing acceptable and permissible use patterns of information assets; and assigning levels of stewardship responsibilities accordingly in order to limit data exposure and integration risks across all platforms. The committee will meet periodically to discuss all data governance issues, reporting on gaps in quality control and releasing/publicizing their findings. Items for committee consideration may include the following and beyond:

    • Maintenance and storage strategies for corporate reference data.
    • Comprehensive standards for data dictionaries, data models, calculation engines, business rules, ETL deltas and more.
    • XML strategy: While XML and associated namespaces have been a boon for system-to­system connectivity and communication exchange, standardizing the syntax, format and relationships of information objects -- a semantic consistency that reconciles business data and business-speak ­ will not always be easily achieved. People must know what XML tags mean in the real world business sense, across all of an enterprise's data structures, whether they are financial, manufacturing, etc. The stewardship committee will organize and establish correct linkages for all vital XML elements, artifacts and attributes via a meta data registry (MDR) that will document and define (i.e., capture meanings about) all XML namespaces by storing descriptions of data units and values across systems and data contexts.
    • Data privacy and data security policies.
    • Web-enabled universal "data shopping" on corporate intranet.
    • Auditing of service level agreements to ensure proper data availability.
    • Data standards: documented data domains, validations rules, data dependencies; source to target mappings; semantic management, naming conventions, data-typing, etc.
    • Replication, backup and recovery policies and procedures.

    With the relentless push toward federated-access systems that offer complex real-time (or near real-time) distribution and availability/exposure of data services, data volumes and information proliferation have grown to historically high levels. For many, data integrity problems have become worse than ever, establishing huge roadblocks for the timely delivery and sharing of quality data. An unceasing program of data governance and management, established by carefully selected people, roles, responsibilities and technologies, may be long overdue. Address your issues early (now!), not when you are ready to expose data to share. Address your enterprise-wide data quality issues with respect to accessibility, accuracy, consistency and completeness. Get executive buy-in and start your data stewardship program today! 




    William Laurent is practice director for Business Intelligence and Corporate Governance at Global Passage LLC. Laurent has a diverse systems background, successfully designing and managing implementation of projects for the insurance, banking, finance, government, technology and entertainment industries.  He would enjoy receiving your comments at wlaurent@gpassllc.com.

    Copyright 2007, SourceMedia and DM Review.

  • RM 101

    Why Records Management?

    by Priscilla Emery
    30-May-2005 --<script></script> Comments -- none yet<script src="http://www.intensedebate.com/js/getCommentLink.php?acct=4f391d025c7a274f56ab1bba3cf65df1&postid=http://www.cmswatch.com/Feature/127-RM-101&posturl=http://www.cmswatch.com/Feature/127-RM-101&posttitle=" type="text/javascript"></script><script src="http://www.intensedebate.com/js/genericLinkWrapperV2.js" type="text/javascript"></script>

     

    Ed.'s Note: This article is excerpted from the Records Management Report, published by CMS Watch.

    To some, managing records represents one of the most boring and onerous business functions that anyone could possibly undertake within an organization. Of course, most people don’t even understand what records management is -- making it easy to malign an activity that is so misunderstood.

    Indeed records management crosses numerous disciplines. Did you know…

    • That categorization and indexing are two elements that are critical to the success of a records management program? Sounds a little like knowledge management.
    • That vital records preservation is one of the key steps in developing a disaster recovery plan? Sounds like infrastructure management.
    • That by ignoring records management policies employees and their companies can potentially end up facing criminal penalties? Sounds like a legal profession.
    • What led to Arthur Andersen’s downfall? Shredding — inappropriate shredding — shredding of records that should have been retained according to the policies of both Andersen and Enron.

     

    Despite what people may say, what you don’t know can hurt you and, in the case of records management, what you choose to ignore can cripple you and your organization.

    Recent dramatic headlines have made it quite apparent that records management (or the lack thereof) is an essential activity to ascertain and confirm the credibility of many business transactions and government activities. The proliferation of electronic documents (especially e-mails) and the potential litigation exposure that they cause are becoming the bane of legal advisors and records managers in many corporations and government agencies. Microsoft, Texaco and other Fortune 500 companies have taken hits from “runaway” e-mails introduced as evidence in high stakes cases.

    But, to records managers e-mail is just another record type in the scheme of things (not a trivial record type but a record type just the same). They know that in fact, “smoking guns” can come in many different guises, including: paper, electronic image files, video tape, voice recordings, etc. It’s all potential evidence and therefore potential deposition fodder. Making sure that all the relevant information is accessible in a timely fashion, should an organization need to defend itself in a lawsuit, is also what records managers are responsible for.

    These days a records manager’s job is anything but boring, and in many organizations there aren’t enough of them to handle the increased compliance laws and regulations that have cropped up in recent months.

    Not surprisingly, then, many companies are looking at records management software as a way to get a better grip on the increased volume of all kinds of records. Although that approach can help, a basic understanding of what records management is and its concomitant practices are required before even looking at any software product. If you don’t have a records management policy in place along with the appropriate retention and destruction rules for the records, the software will be useless.

    What is Records Management?

    There are many different definitions of records management but one that I like the best is,

    “A professional discipline that is primarily concerned with the management of document-based information systems. The application of systematic and scientific controls to recorded information required in the operation of an organization’s business. The systematic control of all organizational records during the various stages of their life cycle: from their creation or receipt, through their processing, distribution, maintenance and use, to their ultimate disposition. The purpose of records management is to promote economies and efficiencies in recordkeeping, to assure that useless records are systematically destroyed while valuable information is protected and maintained in a manner that facilitates its access and use.”

    Some people have the mistaken impression that records management is about hoarding everything that comes across one’s desk in the course of doing business. In some highly regulated industries it may seem that is the case. But in most cases it’s not only making sure that what needs to be kept as a record is retained but also prescribing how long it should be kept, where it should be stored, who has access to it and when it should be destroyed (if ever).

    There is even an ISO standard for developing a records management program (ISO 15489). However, even though records retention practices are considered systematic and scientific, deciding what is a record and how long it should be retained is a combination of both prescribed practice and a certain amount of subjectivity depending on the actual documents in question. Just defining what is a record can get somewhat confusing to people.

    What is a Record?

    According to the Federal Records Act a record is, “recorded information, regardless of medium or characteristics, made or received by an organization that is evidence of its operations and has value requiring its retention for a specific period of time.”

    According to the National Archives and Records Administration (NARA) records include,

    “… all books, papers, maps, photographs, machine-readable materials, or other documentary materials, regardless of physical form or characteristics, made or received by an agency of the U. S. Government under Federal law or in connection with the transaction of public business and preserved or appropriate for preservation by that agency or its legitimate successor as evidence of the organization, functions, policies, decisions, procedures, operations, or other activities of the Government or because of the informational value of the data in them.”

     

    Got it?

    This can be confusing for non-governmental organizations that don’t believe that these definitions necessarily apply to them so a more generic definition would be that records are recorded information, regardless of physical form, that are generated or received and used while conducting business, and preserved because of their informational value or as evidence of an organization’s functions, policies, decisions, procedures, operations, mission, programs, projects, and activities.

    This is why there is no real cookie-cutter approach to defining a records retention program for every document out there — every company has to look at its own business operations and examine what should be considered a record and then decide what retention rules to apply against them. Again, the government and other industry regulators may help in that decision by defining exactly what they expect to see if they have to do an audit or investigation, but audits and investigations aren’t the only reasons to have a records management program.

    There are other ways to determine if an information item should be considered a record or not:

    • Does the information document your daily business process?
    • Does the information provide input to a mission-critical business decision?
    • Does the information provide evidence as to whcy a business decision was made?
    • Is the information required for legal, fiscal, audit, or tax purposes?

     

    "Vital Records" contain unique or irreplaceable information and require special protection, such as articles of incorporation, annual reports and shareholder records. These should be defined as an integral part of a disaster recovery plan or operation.

    RM Programs, RM Software

    Essential elements of a records management program include a records retention policy and a set of procedures where records are classified, retention periods are defined and destruction procedures are prescribed. Classification of records and then maintaining them with the appropriate metadata are necessary so that this information can be retrieved quickly when required.

    Record integrity depends on three attributes: content, context, and the structure of the original record. Authenticity is very important to determining if a piece of information is the “true” record of an event or business transaction, which is why the management of electronic records, most especially e-mails, web pages and instant messages, is so problematic. The ability to generate larger volumes of records in a shorter period of time has increased the productivity of many organizations but that productivity gain can be lost if essential records can’t be found during the course of doing business or, in even more critical procedures such as when an audit takes place or a legal discovery of evidence is required. Finding lost records costs money in terms of lost productivity and in more the substantial costs of fines, court costs and lawsuit settlement costs.

    Hence the need for records management software that aid in the record repository management activities of many organizations. These products help users:

    • manage retention schedules,
    • store records in their appropriate classifications along with the prescribed metadata,
    • search for records when required,
    • destroy records according to prescribed schedules,
    • put holds on records so they don’t get destroyed while the company goes through an audit or investigation, and
    • track the access of the record over its active and in-active life cycle.

     

    In many cases these systems track not only electronic documents but also other record objects such as paper, boxes and tapes...

    Amid all the focus on key records management features such as record declaration and standards certification, one key factor sometimes gets lost in the shuffle. Records management is a core activity within many different business processes within the organization itself. It is in and of itself not meant to be the core application. Just like filing is an adjunct activity to one's work, records management must fit into the core business process in order to be truly effective.

    Therefore, technological elegance shouldn't be the focus -- integration with core business processes is more important. Ability to support and integrate with key business applications and be imbedded as an activity within a workflow should become more important evaluation criteria in the long run...

     

    <script></script>

    Comments

    Logging you in...
    Close Login with your OpenID
    • Logged in as
    There are no comments posted yet. Be the first one!

    Post a new comment

    Or post using OpenID
    Note: CMS Watch moderates all comments. Our policies:
    • No anonymity - you must indicate your name and affiliation in the text of your comment
    • Disagreements welcome, but no marketing by vendors or other firms, and please mind your manners

    <script src="http://www.intensedebate.com/comment/4f391d025c7a274f56ab1bba3cf65df1/generic/11345849" type="text/javascript"></script><script src="http://www.intensedebate.com/js/genericCommentWrapperV2.js" type="text/javascript"></script>

     


     

    Next:

    Send Feedback

     

    See all ECM Suites Channel feature articles.

    Need to select a technology vendor, but confused about your choices? See our vendor-neutral technology reports.

    Join the conversation

    Digg This! Search Technorati Tag it on Del.icio.us


    About the Author

    Priscilla Emery

    Priscilla Emery is President and founder of e-Nterprise Advisors and has provided market research and analysis services to Fortune 1000 and high technology companies. With more than 30 years of technology experience, her areas of expertise include imaging, document, records and e-mail management, and other technologies in the enterprise content management (ECM) landscape.


     

     

    Get a Free Sample

    Wondering about CMS Watch research? Sign up to receive free samples of any of our products.


     

    What we do

    CMS Watch™ evaluates content-oriented technologies, publishing head-to-head comparative reviews of leading solutions. What makes us special?

    • Our critical analysis exposes product weaknesses as well as strengths
    • We deliver unrivaled technical depth and comprehensive project advice
    • Our research is led by international topic experts
    • We only work for buyers -- never for vendors

     

    Contact us

    CMS Watch

    info@cmswatch.com

    3470 Olney-Laytonsville Road Suite 131

    Olney, MD USA 20832

    1 800 325 6190 (customer service)

    +1 617 763 5336 (int'l customer service)

    Fax: +1 214 242 3048




  • Apr 03, 09

    Jesse Wilkin's personal blog. Expertise in the areas of document imaging, ECM, and software development and testing. More recently electronic records management, email management, and especially Web/Enterprise/Buzzword 2.0

    • In fact, it is sometimes a necessity. Experts say that in court, the burden is  on a policyholder to prove that a policy ever existed.<!-- Begin Floating Content -->
    • Policy retention still is an issue for many businesses, and electronic  technology has made large-scale document storage easier. However, the advent of  claims-made policies has reduced the value of expired liability policies.

    1 more annotation...

    • Google(Google reviews) announced today that CVS/pharmacy, one of the  largest pharmacy chains in the U.S., has partnered with Google Health to provide  patients online access to their prescription drug history through Google Health  accounts. This is in addition to Walgreens Pharmacy, Meijer, Medco, and other  national pharmacies.
    • For those who have not used Google Health, it  currently allows you to import your medical records from over a dozen  pharmacies, medical centers, and health insurance providers. Once imported, you  can review those records and keep them updated, with the help of Google and its  partners.
  • Apr 21, 09

    Remarks by the Deputy Governor of the British Virgin Islands on the importance of Records Managment. They set aside a day in April to recognize
    RIM. Annotated link http://www.diigo.com/bookmark/http%3A%2F%2Fwww.sknvibes.com%2FNews%2FNewsDetails.cfm%2F914

    • For the past five years, the Archives and Records Management Unit  within the Deputy Governor’s Office has brought attention to the importance of  Records and Information Management by setting aside a special day in its  calendar which coincided with annual Mayday celebrations. 
    • The question could be asked, ‘Why has Records and Information Management become  so important in the global business environment?’  Simply put, businesses  often operate in complex legal and regulatory environments and the penalties for  non-compliance are increasingly unforgiving.  Additionally, with  information continuously on the increase both in electronic and paper format, it  is important that that information is captured, well managed and properly  preserved. Hence the need for a good Records and Information Management  programme which could realize important benefits for organizations. Ensuring  that information is retrieved in a timely manner, saving space and reducing  storage costs, supporting decision-making, preserving corporate memory and  ensuring regulatory and legal compliance, are only a few of such benefits.
  • 1 - 20 of 32 Next ›
    20 items/page
    List Comments (0)