Enterprise content management
Encyclopedia : E : EN : ENT : Enterprise content management
Enterprise content management (ECM) is a widely-recognized information technology-industry term for software technology that enables organizations to create/capture, manage/secure, store/retain/destroy, publish/distribute, search, personalize, and present/view/print digital content such as pictures/images, text, reports, video, audio, transactional data, catalog, code. ECM systems primarily focus on the capture, storage, retrieval, and dissemination of digital files for enterprise use and their life-cycle management.
ECM systems are generally tactical and non-discretionary expenditures, but they are increasingly being viewed as strategic core investments as organizations deal with accelerating business velocities, consolidation of redundant content management systems, exponential growth of content, and compliance issues (mandated or perceived). Moreover, these systems are becoming more infrastructure than application like.
- 1 Definition
- 2 Characteristics
- 3 Components of an enterprise content management system
- 3.1 Capture
- 3.1.1 Manually generated and captured information
- 3.1.2 Technologies for processing captured information
- 3.1.3 Document Imaging
- 3.1.4 Forms processing
- 3.1.5 COLD
- 3.1.6 Aggregation
- 3.1.7 Components for subject indexing of captured information
- 3.2 Manage
- 3.2.1 DM – Document Management
- 3.2.2 Collaboration (collaborative systems, groupware)
- 3.2.3 WCM – Web Content Management
- 3.2.4 RM - Records Management (file and archive management)
- 3.2.5 Wf - Workflow / BPM - Business Process Management
- 3.3 Store
- 3.4 Preserve
- 3.5 Deliver
- 4 Outlook
- 5 ECM market development
- 6 Literature and source of this article
- 7 See also
- 8 External links
Definition
The "official" definition of enterprise content management was created by AIIM international, the worldwide association for enterprise content management in the year 2000. The acronym ECM has been reinterpreted and redefined many times during the past years, replacing words like “create” or “customize” that were originally part of it.In autumn 2005 AIIM defines ECM as follows:
- Enterprise Content Management is the technologies used to Capture, Manage, Store, Preserve, and Deliver content and documents related to organizational processes.
Thus, the term enterprise content management refers to solutions that most often use Internet technologies, but concentrate on in-house information provision. The solutions tend to be enterprise portals for B2B as extranet and B2E as intranet. This category includes most of the former document management, groupware, and workflow vendors who have not yet fully converted their architecture, but simply put a web server in front of their applications. Enterprise content management follows a multilayered component approach that provides the necessary infrastructure for any application.
Characteristics
Content management has many facets including enterprise content management, Web content management (WCM), content syndication and digital or media asset management. Enterprise content management is a vision, a strategy, or even a new industry, but it is not a closed system solution or a distinct product. Therefore, along with DRT (Document Related Technologies) or DLM (Document Lifecycle Management), ECM can be considered as just one possible catch-all term for a wide range of technologies and vendors.A comparison of the definitions of the different application fields of ECM and WCM makes it clear that the existing system category distinctions cannot last long, whether for products and technical platforms or for usage models. Solutions that are used as pure in-house solutions today will be made accessible to partners or customers tomorrow. The content and structure of today’s outward-directed web portal will be the platform for tomorrow's internal information system. In his article in ComputerWoche (September 2001), Ulrich Kampffmeyer concentrated the claimed benefit of an enterprise content management system to three key ideas that distinguish such solutions from Web content management:
- "Enterprise Content Management as integrative middleware
- ECM is used to overcome the restrictions of former vertical applications and island architectures. The user is basically unaware of using an ECM solution. ECM offers the requisite infrastructure for the new world of web-based IT, which is establishing itself as a kind of third platform alongside conventional host and client/server systems. Therefore, EAI Enterprise Application Integration and SOA Service Oriented Architecture will play an important role in the implementation and use of ECM.
Components of an enterprise content management system
[de]Enterprise content management systems combine a wide variety of technologies and components, some of which can also be used as stand-alone systems without being incorporated into an enterprise-wide system.
These ECM components and technologies are categorized as:
- Capture,
- Manage,
- Store,
- Deliver, and long-term
- Preserve.
The traditional application areas are:
- Document management (DM),
- Collaboration (or collaborative software, groupware),
- Web content management (WCM) (including web portals),
- Records management (RM) (archive and filing management systems on long-term storage media) and
- Workflow / Business process management (BPM)
The individual categories and their components will be examined in the following.
Capture
[de]The “Capture” category contains functionalities and components for generating, capturing, preparing and processing analog and electronic information. There are several levels and technologies, from simple information capture to complex information preparation using automatic classification. Capture components are often also called “Input” components.
Manually generated and captured information
Manual capture can involve all forms of information, from paper documents to electronic office documents, e-mails, forms, multimedia objects, digitized speech and video, and microfilm.Automatic or semi-automatic capture can use EDI or XML documents, business and ERP applications or existing specialist application systems as sources.
Technologies for processing captured information
Various recognition technologies are used to process scanned documents and digital faxes, among them:- This converts image information into machine-readable characters. OCR is used for type.
Document Imaging
Document imaging processing techniques are used to show scanned images, and also allow legibility enhancement for capture. Functions like “despeckling,” which removes isolated pixels, or “adjustment,” which straightens images from sheets that feed in at an angle, improve the results of recognition technologies. Document imaging functions are used in capture quality control.Forms processing
In forms capture, there are two groups of technologies, although the information content and character of the documents may be identical.- Forms Processing
- Forms Processing means the capture of industrially or individually printed forms via scanning. Recognition technologies are often used here, since well-designed forms enable largely automatic processing.
COLD
COLD/ERM are technologies for the automatic processing of structured entry data. COLD stands for Computer Output to Laser Disk and is still in use although laser disks have not been on the market for years. The acronym ERM here stands for Enterprise Report Management. In both, supplied output data is processed based on existing structure information in such a way that it can be indexed independently of the origination system, and transferred to a storage component that can be dynamic (Store) or an archive (Preserve).Aggregation
Is a process of combining data entries from different creation, capture, and delivery applications. The goal is to combine and unify data from different sources, in order to pass them on to storage and processing systems with a uniform structure and format.Components for subject indexing of captured information
Systems incorporate further components for subject indexing and getting captured digital information to the appropriate recipients. These include:- Indexing (manual)
- In English parlance, indexing refers to the manual assignment of index attributes used in the database of a “manage” component for administration and access.
Manage
[de]The Manage components are for the management, processing, and use of information. They incorporate:
- Databases for administration and retrieval, and
- Access authorization systems
DM – Document Management
[de]Document management in this context does not refer to the industry known in Europe as DMS, but to document management systems in the narrower “classical” sense. These systems control documents from their creation through to long-term archiving. Document management includes functions like:
- Check in/Check out
- For checking stored information for consistency
Collaboration (collaborative systems, groupware)
[de]Collaboration actually simply means “working together”. However, these solutions, which developed from conventional groupware, now go much further and include elements of Knowledge Management. Collaboration includes the following functions:
- Jointly usable information databases
- Joint, simultaneous, controlled information processing
- Knowledge based on skills, resources and background data for joint information processing
- Administration components such as whiteboards for brainstorming, appointment scheduling, project management etc.
- Communication application such as video conferencing
- Integration of information from other applications in the context of joint information processing
WCM – Web Content Management
[de]Enterprise Content Management claims to integrate Web Content Management. However, information presented on the Internet and Extranet or on a portal should only be data that is already present in the company, whose delivery is controlled by access authorization and storage. Web Content Management includes the following functions, among others:
- Creation of new or editing of existing information in a controlled generation and publishing process
- Delivery and administration of information for the web presentation
- Automatic conversion for various display formats, personalized display and versions
- Secure separation of access to public and non-public information
- Visualization for Internet presentation (browser, HTML, XML etc.)
RM - Records Management (file and archive management)
[de]Unlike with traditional electronic archival systems, Records Management (RM; Electronic Records Management or ERM) refers to the pure administration of records, important information and data that companies are required to archive. Records Management is independent of storage media, and can also manage information stored otherwise than in electronic systems. Among the functions of Records Management are:
- Visualisation of file plans and other structured indexes for the orderly storage of information
- Unambiguous indexing of information, supported by thesauri or controlled wordlists
- Management of record retention schedules and deletion schedules
- Protection of information in accordance with its characteristics, sometimesdown to individual content components in documents
- Use of international, industry-specific or at least company-wide standardized meta-data for the unambiguous identification and description of stored information
Wf - Workflow / BPM - Business Process Management
[de]Workflow and Business Process Management differ substantially.
There are different types of Workflow, for example:
- “Production Workflow” which uses predefined sequences to guide and control processes
- “Ad-Hoc Workflow” in which the user determines the process sequence on the fly.
- “Workflow solutions” with autonomous clients which users mostly work with, or as
- “Workflow Engines” which act as a background service controlling the information and data flow, without requiring an own client for this.
- Visualisation of process and organization structures
- Capture, administration, visualization, and delivery of grouped information with its associated documents or data
- Incorporation of data processing tools (such as specific applications) and documents (such as office products)
- Parallel and sequential processing of procedures including simultaneous saving
- Reminders, deadlines, delegation and other administration functionalities
- Monitoring and documentation of process status, routing, and outcomes
- Tools for designing and displaying process
BPM or Business Process Management goes a step further than Workflow. Although the words are often used interchangeably. BPM aims at the complete integration of all affected applications within an enterprise, with monitoring of processes and assembling of all required information. Among BPM’s functions are:
- Complete workflow functionality
- Process and data monitoring at the server level
- EAI or Enterprise Application Integration, to link different applications
- BI or Business Intelligence, with rule structures, integration of information warehouses, and utilities that assist users in their work.
Store
[de]“Store” components are used for the temporary storage of information which it is not required or desired to archive. Even if it uses media that are suitable for long-term archiving, “Store” is still separate from “Preserve.”
The “Store” components listed by AIIM can be divided into three categories: “Repositories” as storage locations, “Library Services” as administration components for repositories, and storage “Technologies.” These infrastructure components are sometimes held at the operating system level like the file system, and also include security technologies which will be discussed farther below in the “Deliver” section. However, security technologies including access control are superordinated components of an ECM solution.
Repositories
Different kind of ECM repositories can be used in combination. Among the possible kinds are:- File systems are used primarily for temporary storage, as input and output caches. The goal of ECM is to reduce the data burden on the file system and make the information generally available through “Manage”, “Store” and “Preserve” technologies.
Library Services
Library Services have to do with libraries only in a metaphorical way. They are the administrative components close to the system that handle access to information. The Library Service is responsible for taking in and storing information from the Capture and Manage components. It also manages the storage locations in dynamic storage, the actual “Store”, and in the long-term “Preserve” archive. The storage location is determined only by the characteristics and classification of the information. The Library Service works in concert with the database of the “Manage” components. This serves the necessary functions of- Search, and
- Retrieval
- Online storage (direct access to data and documents)
- Nearline storage (data and documents on a medium that the drive can access, but for which robotics or something similar must first be set up)
- Offline storage (data and documents on a medium that is removed from system access).
- Version management to control the status of information
- Check-in/Check-out, for controlled information provision
Storage technologies
A wide variety of technologies can be used to store information, depending on the application and system environment:- Read and Write Magnetic Online Media
- This includes hard drives as RAID (Redundant Array of Independent Disks) server drive subsystems, Storage Area Networks (SANs) as storage infrastructures and Network-attached storage (NAS) as directly accessible network storage areas.
Preserve
[de]The “Preserve” components of ECM handle the long-term, safe storage and backup of static, unchanging information, as well as temporary storage of information that it is not desired or required to archive. This is sometimes called “electronic archiving,” but that has substantially broader functionality than that of “Preserve.” Electronic archiving systems today generally consist of a combination of administration software like Records Management, Imaging or Document Management, Library Services (IRS - Information Retrieval Systeme) and storage subsystems.
But it is not just electronic media that are suitable for long-term archiving. For purely securing information microfilm is still viable, and is now offered in hybrid systems with electronic media and database-supported access. The decisive factor for all long-term storage systems is the timely planning and regular performance of migrations, in order to keep information available in the changing technical landscape. This ongoing process is called Continuous Migration. The “Preserve” components contain special viewers, conversion and migration tools, and long term storage media:
Long term storage media
- WORM optical disk
- Write Once Read Many (WORM) rotating digital optical storage media, which include the classic 5 ¼" in or 3 ½" WORM disc in protective sleeve, as well as CD-R and DVD-R. Recording methods vary for these media, which are held in jukeboxes for online and automated nearline access.
Long term preservation strategies
To secure the long term availability of information different strategies are used for electronic archives.- Migration
- Continuous migration of applications, index data, meta data and objects from older systems to new ones generates a lot of work but secures the accessibility and usability of information, and allows during this process to delete no longer relevant information. Conversion technologies are used to update the formats of the stored information.
Deliver
[de]The “Deliver” components of ECM are used to present information from the “Manage”, “Store”, and “Preserve” components. They also contain functions used to enter information in systems (such as information transfer to media or generation of formatted output files) or for readying (for example converting or compressing) information for the “Store” and “Preserve” components. Since the AIIM component model is function-based and not to be regarded as an architecture, we can assign these and other components here. The functionality in the “Deliver” category is also known as “output” and summarized under the term “Output Management.”
The “Deliver” components comprise three groups of functions and media: Transformation Technologies, Security Technologies, and Distribution. Trans¬formation and Security as services belong on the middleware level and should be available to all ECM components equally. For Output two functions are of primary importance:
- Layout/Design
- With tools for layouting and formatting output, and
Transformation technologies
Transformations should always be controlled and trackable. This is done by background services which the end user generally does not see. Among the transformation technologies are:- COLD / ERM (Computer Output to Laser Disc)
- As distinct from “Capture” components, it prepares output data for distribution and transfer to the archive. Typical applications are lists and formatted output, for example individualized customer letters. These technologies also include journals and logs generated by the ECM components. Unlike most imaging media COLD records are indexed not in a database table but by absolute positions within the document itself (i.e. page 1 line 82, position 12). As a result COLD index fields are uneditable after submission unless they are converted into a standard database.
Security Technologies
Security technologies are cross-section functions that are available to all ECM components. For example, electronic signatures are used not only when documents are sent, but also in data capture via scanning, in order to document the completeness of the capture. PKI (Public/Private Key Infrastructure) is a basic technology for electronic signatures. It manages keys and certificates, and checks the authenticity of signatures. Other electronic signatures demonstrate the identity of the sender and the integrity of the sent data, i.e. that it is complete and unchanged. In Europe there are three forms of electronic signatures, of different quality and security: simple, advanced, and qualified. In most European states the qualified electronic signature is legally admissible in legal documents and contracts. Finally, there is Digital Rights Management and Watermarking. This is used in Content Syndication and in MAM (Media Asset Management) for managing and securing intellectual property rights and copyrights. It works with techniques like electronic watermarks that are integrated directly into the file, and seeks to protect usage rights and protect content that is published on the Internet.Distribution
All of the above technologies basically serve to provide the various contents of an ECM to target users by various routes, in a controlled and user-oriented manner. These can be active components such as e-mail, data media, memos, and passive publication on websites and portals where users can get the information themselves. Possible output and distribution media are:- Internet, extranet and intranet
- E-business portals
- E-mail and fax
- Data transfer by EDI, XML or other formats
- Mobile devices like mobile phones, PDAs, and others
- Data media like CDs and DVDs
- Digital TV and other multimedia services
- Paper
Outlook
The former member of the board of directors of AIIM international, Ulrich Kampffmeyer, states in his whitepaper on ECM in 2003:Document technologies like Enterprise Content Management make traditional data processing complete. They bring together structured, weakly structured, and unstructured information. Every company, every government agency, and every organization must confront the subject. Even if there are no immediate plans to implement such a system, it sneaks into the organization of its own accord – with the next server licence update, with the next office software suite, with the next database or ERP upgrade. In many companies with heterogeneous IT landscapes, the question of which redundant functionalities of existing products are unused is already more important than whether to invest in a new software system. The most important job is to keep in-house information under control. The questions add up: where to put the thousands and thousands of e-mails, what to do with the electronically signed business correspondence, where to put taxation-relevant data, how to transfer information from the disorganized file system, how to consolidate information in a repository that everybody can use, how to get a single login for all the systems, how to create a uniform in-basket for all incoming information, how to make sure that no information is lost or ignored, etc. etc. Document technologies play an important role in all these questions. ECM solutions are necessary basic components for many applications.
Every potential user will naturally consider his own individual needs before deciding on a system. However, putting off decisions does not make them less necessary. Every year something supposedly better and easier to use will come along, but waiting will just mean never installing anything. Every time the decision is put off, the mountain of uncontrolled and unused information gets bigger, and known problems get larger. A sensible long-term migration strategy removes the fear of fast technology change. The basic functions of document technology are mature, and most products are reliable, stable, secure, and increasingly affordable. In many industries, the use of document technology makes the difference in staying competitive. ECM - Enterprise Content Management – should be a part of every modern IT infrastructure.
ECM market development
Gartner, a leading industry analyst firm, estimates that by midyear 2006, 50 percent of ECM vendors will merge or be acquired (0.6 probability). According to Gartner, by 2008, 75 percent of Global 2000 companies will have a desktop-focused and a process-focused content management implementation (0.9 probability) and ECM will continue to absorb other technologies, such as digital asset management and e-mail management. Gartner also predicts that there will be further market consolidation, acquisition and separation of vendors into platform and solution providers[[Citing sources citation needed]].According to Gartner reports as of November 2005, the ECM market leaders are Open Text, EMC (Documentum), IBM, FileNet, Stellent and Hummingbird. The lone member of the challengers quadrant is Hyland Software. Other companies offering some form of ECM solutions include Claromentis, Xerox, Oracle Corporation, Objective Corporation, RedDot, TOWER Software, ASG-Cypress, Unicorn Enterprise System, Kofax, [Hershey Systems], Vignette, Interwoven, and [InfiNet Business Systems][[Citing sources citation needed]].
Open source developments
Plone, Nuxeo CPS, and Alfresco are open source ECM platforms.Literature and source of this article
- [ECM definition] by AIIM International
- [Original ECM article] by the German consulting company PROJECT CONSULT
See also
External links
- [AIIM] the international association for Enterprise Content Management
- [ECM vendors] list (in PDF format) with product classification covering Europe
- [CMS Watch] the ECM channel on CMS Watch.
- [CM Pros] the Content Management Community of Practice.
- [Better ECM Blog] Better ECM - Russ Stalters’ Blog on Exploring NextGen ECM
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
