Enterprise content management
Enterprise content management extends the concept of content management by adding a timeline for each content item and, possibly, enforcing processes for its creation, approval and distribution. Systems using ECM generally provide a secure repository for managed items, analog or digital. They also include one methods for importing content to bring manage new items, and several presentation methods to make items available for use. Although ECM content may be protected by digital rights management, it is not required. ECM is distinguished from general content management by its cognizance of the processes and procedures of the enterprise for which it is created.
Definitions
- Late 2005: The technology used to capture, manage, store, preserve, and deliver content and documents related to organizational processes
- Early 2006: ECM tools and strategies allow the management of an organization's unstructured information, wherever that information exists.
- Early 2008: The strategies, methods, and tools used to capture, manage, store, preserve, and deliver content and documents related to organizational processes. ECM tools and strategies allow the management of an organization's unstructured information, wherever that information exists
- Early 2010: The strategies, methods, and tools used to capture, manage, store, preserve, and deliver content and documents related to organizational processes. ECM covers the management of information within the entire scope of an enterprise whether that information is in the form of a paper document, an electronic file, a database print stream, or even an email
- March 2017: The Association for Information and Image Management proposed replacing "enterprise content management" with "intelligent information management". IIM is defined as "the strategies, methods, and tools used to create, capture, automate, deliver, secure, and analyze content and documents related to organizational processes. IIM refers to the management of content AND data, not just content itself."
ECM, as an umbrella term, covers document and web content management, search, collaboration, records management, digital asset management, workflow management, and capture and scanning. It manages the life cycle of information, from initial publication through archival and eventual disposal. It is delivered in four ways:
- On-premises software
- Software as a service : Web access to information stored on a software manufacturer's system
- A hybrid of both on-premises and SaaS components
- Infrastructure as a Service : Online services which abstract the user from infrastructure details like physical computing resources, location, data partitioning, scaling, security, and backup
Scope
Enterprise content management, a form of content management, combines the capture, search and networking of documents with digital archiving, document management and workflow. It includes the challenges involved in using and preserving a company's internal information in all of its forms. Most ECM solutions focus on business-to-employee systems.New ECM components have emerged. As content is checked in and out, each use generates new metadata. Information about how people use the content can allow the system to acquire new filtering, routing and search pathways, corporate taxonomies and semantic networks, and retention-rule decisions.
Solutions can provide intranet services to employees, and can include enterprise portals for business-to-business, business-to-government, government-to-business, or other business relationships. This category includes most former document-management groupware and workflow solutions that had not, by 2016, fully converted their architecture to ECM but provided a web interface. Digital asset management is a form of ECM involving digitally-stored content.
History
The technologies which encompassed ECM in 2016 descend from the electronic document management systems of the late 1980s and early 1990s. The original EDMS products were stand-alone, providing functionality in one of four areas: imaging, workflow, document management, and enterprise relationship management.A typical early EDMS user had a small-scale imaging and workflow system to improve a paper-intensive process and work towards a paperless office. The first stand-alone EDMS technologies intended to save time by reducing paper handling and storage, reducing document loss and speeding access to information. EDMS could provide online access to information formerly available only on paper, microfilm, or microfiche. By improving control over documents and their processes, EDMS streamlined business practices. Their audit trail increased document security and measured productivity and efficiency.
EDMS product categories were seen as complementary, and organizations wished to use several EDMS products. A customer-service department could combine imaging, document management and workflow; an accounting department could access supplier invoices from an ERM system, purchase orders from an imaging system, and contracts from a document-management system. As organizations established an Internet presence, they wanted to manage web content. Organizations which had automated individual departments began to envision a broader deployment.
The movement toward integrated EDM systems reflected a common trend in the software industry: the integration of small systems into more comprehensive ones. Word processing, spreadsheet, and presentation software were standalone products until the early 1990s, when the market shifted toward integration.
Early developers offered multiple stand-alone EDMS technologies as a single, packaged "suite", with little functional integration. Around 2001, the industry began to use the term "enterprise content management" for integrated systems.
In 2006, Microsoft and Oracle Corporation entered the low-cost ECM market. Open source ECM products are also available.
Government standards, including the Health Insurance Portability and Accountability Act, BS 7799 and ISO/IEC 27001, influence the development and use of ECM. In 2016, organizations could deploy a single ECM system to manage information in all departments.
Uses
Businesses adopt ECM to increase efficiency, improve information control, and reduce the overall cost of information management. ECM streamlines access to records with keyword and full-text searching, allowing employees to quickly obtain needed information from their desktops.The management systems can help businesses comply with government and industry regulations such as HIPAA, the Sarbanes–Oxley Act, the Payment Card Industry Data Security Standard, and the Federal Rules of Civil Procedure. Security at the user, function, and record levels protect sensitive data. Some information in a document can be redacted, so the remainder can be shared without compromising identity or key data. Every action in the system is tracked, and can be reported to demonstrate compliance with a wide variety of regulations.
Characteristics
In his Computerwoche article, Ulrich Kampffmeyer characterized ECM as:- Middleware, eliminating the restrictions of vertical applications and island architecture and transparent to users. ECM offers a third platform, in addition to conventional host and client-server systems. According to Kampffmeyer, enterprise application integration and service-oriented architecture will play important roles in ECM implementation.
- Independent services, managing information without regard to the source or the required use and available from a variety of applications. For a given use, only one general service is available; this avoids the expense and maintainance of parallel functions. Standards for interfaces connecting different services will play an important role in ECM implementation.
- A uniform repository for information, data and document warehouses combining company information. Information lifecycle management will also play an important role in the implementation of ECM.
Components
ECM combines components which can be used as stand-alone systems without being incorporated into an enterprise-wide system. The five ECM components were defined by the Association for Information and Image Management as:- Capture
- Manage
- Store
- Preserve
- Deliver
Capture
Earlier document automation systems photographed documents for storage on microfilm or microfiche. Image scanners make digital copies of paper documents. Documents already in digital form can be copied if they are available online. Automatic or semi-automatic capture can use electronic data interchange or XML documents, business and ERP applications, or specialized-application systems as sources.
Recognition technologies
Recognition technologies to extract information from scanned documents and digital faxes include:- Optical character recognition : Converts images of typeset text into alphanumeric characters
- Handwriting recognition : Converts images of handwritten text into alphanumerics
- Intelligent character recognition : Extends OCR and HWR to use comparison, logical connections, and checks against reference lists and existing data to improve recognition
- Optical mark recognition : Reads special markings in predefined fields
- Barcode recognition: Decodes industry-standard encodings of product and other commercial data
Image cleanup
Forms processing
Forms processing has two groups of technology, although the information content and character of the documents may be identical. It is the capture of printed forms via scanning; recognition technologies are often used, since well-designed forms enable automatic processing. Automatic processing can capture electronic forms if the layout, structure, logic, and contents are known to the capturing system.Enterprise report management
records reports and other documents on optical disks or other digital storage for ECM systems. The technology was originally used with laserDiscs.Data aggregation
unifies documents from different applications and sources, forwarding them to storage and processing systems in a uniform structure and format.Subject indexing
improves searches, providing alternative ways of organizing information. Manual indexing assigns index database attributes to content by hand, and is typically used by a "manage" database for administration and access.Automatic and manual attribute indexing can be facilitated with preset input-design profiles, which can describe document classes that limit the number of possible index values or automatically assign certain criteria. Automatic classification programs can extract index, category, and transfer data autonomously. Based on the information contained in electronic information objects, it can evaluate information based on predefined criteria or in a self-learning process.
Manage
The manage category has five application areas:- Document management
- Collaborative software
- Web content management, including web portals
- Records management
- Workflow and business process management
Document management
s control documents from creation to archiving. They include:- Check in-check out: Checks stored information for consistency
- Version management: Keeps track of different versions of the same information, with revisions and different formats
- Search and navigation: Finds information and its associated contexts
- Organizing documents in files, folders, and overviews
Collaboration
Collaboration components in an ECM system help users work together to develop and process content. Many of these components were developed from collaborative-software packages; ECM collaborative systems nclude elements of knowledge management.They use information databases and processing methods which are designed to be used simultaneously by a number of users on the same content item. Collaboration uses skill-based knowledge, resources and background data for joint information processing. Administration components and communications applications such as video conferencing may be included. Collaborative ECM may also integrate information from other applications.
Web content management
ECM integrates web content management systems, presenting existing information managed in the ECM repository.File and archive management
Unlike traditional electronic archival systems, file and archive management is the administration of records, important information, and data which companies are required to archive. Independent of storage media, managed information does not need to be stored electronically. File and archive management includes:- Visualization of file plans and other structured indexes for the orderly storage of information
- Unambiguous indexing of information, supported by thesauruses or controlled word lists
- Management of record-retention and deletion schedules
- Protection of information in accordance with its characteristics
- International, industry-specific, standardized metadata for the unambiguous identification and description of stored information
Workflow and business process management
- Visualization of process and organization structures
- Capture, administration, visualization, and delivery of information with its associated documents or data
- Incorporation of data-processing tools and documents, such as office products
- Parallel and sequential processing of procedures, including simultaneous saving
- Reminders, deadlines, delegation and other administrative functions
- Monitoring and documentation of process status, routing, and outcomes
- Tools to design and display process
Store
Store components temporarily store information which is not required, desired, or ready for long-term storage or preservation. Even if the store component uses media suitable for long-term archiving, it is still separate from "preserve."Store components may be divided into three categories:
- Repositories: storage locations
- Library services: administration components for repositories
- Storage technologies
Repositories
- File systems: Used primarily for temporary storage, as input and output caches
- Content management systems: Storage and repository systems for content; may be a database or a specialized storage system
- Databases administer information, and can also store documents, content, or media.
- Data warehouses: Complex storage systems based on databases, which provide information from a variety of sources. They may be designed with global functions, such as documents or information.
Library services
It manages online storage, nearline storage, and offline storage.
If the document management system does not provide it, the library service must have version management to control the status of information and check-in/check-out for controlled information provision. It generates an audit trail, logs of information usage and editing.
Storage technologies
A variety of technologies can be used to store information, depending on the application and system environment:- Magnetic online media: Hard disk drives may be locally attached, part of a storage area network, or mounted from another server.
- Magnetic tape: Magnetic tape data storage, in the form of tape libraries, use robotics to provide nearline storage. Standalone tape drives may be used for backup.
- Digital optical media: In addition to compact disc and DVD optical media, storage systems may use magneto-optical drives; optical jukeboxes can be used for nearline storage. Optical media in jukeboxes may be moved offline.
- Cloud computing: Data may be accessed via the Internet.
Preserve
Content eventually stops changing and becomes static. ECM's digital preservation components also temporarily store information which does not need to be archived. Preserve components have special viewers, conversion and migration tools, and long-term storage media:
- Write once read many or optical discs, CD-Rs and DVD-Rs
- Magnetic tapes in secure drives
- Content-addressable storage, with software protection against overwriting, erasure, and editing
- Storage networks may be used if they provide edit-proof auditing, with unchangeable storage and protection against manipulation and erasure.
- Microforms, such as microfilm, microfiche, and aperture cards, are typically used to secure electronic information.
- Paper also secures electronic information.
Long-term preservation strategies
Deliver
presents information from the manage, store, and preserve components. Its Association for Information and Image Management model for ECM is function-based; the deliver components may enter information into other systems or prepare information for storage and preservation.Deliver components may be divided into three groups: transformation technologies, security technologies, and distribution. Transformation and security are middleware services and should be equally available to all ECM components. For output, primary functions are layout and design and electronic publishing.
Transformation technologies
Transformations should be controlled and trackable by background services. They include:- Computer output to laser disc : When used for delivery COLD prepares output data for distribution and archiving. Applications include lists and formatted output, journals and logs.
- Personalization: Functions and output customized for a user's needs
- XML : Enables the standardized, cross-platform description of interfaces, structures, metadata, and documents
- PDF : A cross-platform printing and distribution format which permits content searches, the addition of metadata, and the embedding of electronic signatures. When generated from electronic data, PDFs are resolution-independent and allow crisp reproduction at any scale.
- Open XML Paper Specification : An XML specification, developed by Microsoft, describing the formats and rules for distributing, archiving, rendering, and processing XPS documents.
- Converters and viewers: Generates uniform formats to display information in different formats.
- Data compression: Reduces the storage space needed for pictorial information.
- Web syndication: Presents content in different formats, selections, and forms for multiple use in different forms for different purposes.
Security technologies
Digital rights management and watermarking are used in content syndication and media asset management to manage and secure intellectual property rights and copyrights. Electronic watermarks, embedded in a file, protect use rights for Internet content.
Distribution
ECM is provided to users with a variety of output and distribution media:- Internet
- * Extranets
- * Intranets
- * Electronic business portals
- * Employee portals
- Fax
- Data transfer in EDI, XML or other formats
- Mobile devices, such as mobile phones and personal digital assistants
- Data media, such as CDs and DVDs
- Digital television and other multimedia services
- Paper
Methods
On-premises
ECM was developed as a software application which companies implemented on corporate networks; each company manages and maintains the ECM and the network devices storing the data. On-premises ECM systems may be customized for organizational needs.Since paper-document capture requires devices such as image scanners or multi-function devices, it is typically performed on-premises; however, it may be outsourced to service bureaus for high-volume scanning, indexing and return via web transfer or on CDs, DVDs or other external storage devices.
Software as a service (SaaS)
ECM, also known as cloud computing, is user-accessible online on demand.Market development
Before 2003, the ECM market was dominated by medium-sized independent vendors which fell into two categories: those who originated as document-management companies and began adding the management of other business content, and those who started as web content management providers tried to branch out into managing business documents and rich media.In 2002, Documentum added collaboration capabilities with its acquisition of eRoom; Interwoven and Vignette countered with their acquisitions of iManage and Intraspect. Documentum purchased Bulldog for its digital asset management capabilities; Interwoven and OpenText countered with acquisitions of MediaBin and Artesia. OpenText also acquired the European software companies IXOS and Red Dot. In October 2003, EMC Corporation acquired Documentum. IBM purchased FileNet and Oracle purchased Stellent in 2006; OpenText also purchased Hummingbird Ltd. that year. Hewlett-Packard acquired the Australian company Tower Software in 2008. In March 2009, Autonomy purchased Interwoven; OpenText acquired Vignette in July of that year and MetaStorm in February 2011. OpenText acquired Global 360 in July 2011, and HP agreed to purchase Autonomy in August 2011.
In April 2007, CMS Watch principal Alan Pelz-Sharpe said: "Some of the biggest names in this business are undergoing substantial transformation that will lead to shifting road maps and product sets over the next few years".
Nuxeo and Alfresco offered open-source ECM software that year.
Gartner estimated in 2010 that the ECM market was worth approximately $3.5 billion in 2009; this was expected to grow at a compound annual growth rate of 10.1 percent through 2014. The market experienced a number of mergers and acquisitions in 2010. In 2014, Real Story Group added cloud-based vendors to its 2014 ECM evaluations.