UIMA (/juˈmə/ yoo-EE-mə),[1] short for Unstructured Information Management Architecture, is an OASIS standard[2] for content analytics, originally developed at IBM. It provides a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and integration with search technologies.

Structure

edit

The UIMA architecture can be thought of in four dimensions:

  1. It specifies component interfaces in an analytics pipeline.
  2. It describes a set of design patterns.
  3. It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
  4. It suggests development roles allowing tools to be used by users with diverse skills.

Implementations and uses

edit
Apache UIMA
DevelopersIBM, Apache Software Foundation (since October 2006)
Stable release
3.1.1 / November 8, 2019; 6 years ago (2019-11-08)[3]
Written inJava with C++ enablement
Operating systemcross-platform
Typetext mining, information extraction
LicenseApache License 2.0
Websiteuima.apache.org
Repository

Apache UIMA, a reference implementation of UIMA, is maintained by the Apache Software Foundation.

UIMA is used in a number of software projects:

  • IBM Research's Watson uses UIMA for analyzing unstructured data.[4]
  • The Clinical Text Analysis and Knowledge Extraction System (Apache cTAKES) is a UIMA-based system for information extraction from medical records.
  • DKPro Core is a collection of reusable UIMA components for general-purpose natural language processing.

See also

edit

References

edit
  1. ^ UIMA Frequently Asked Questions (FAQ's) Archived 2013-05-07 at the Wayback Machine The Apache Software Foundation
  2. ^ UIMA Specification The Apache Software Foundation.
  3. ^ "Apache UIMA - News". uima.apache.org. Retrieved 11 December 2019.
  4. ^ "Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog". blogs.apache.org. 14 February 2011. Retrieved 23 April 2018.
edit

📚 Artikel Terkait di Wikipedia

Apache cTAKES

developing statistical NLP components in Java. It is built on top of Apache UIMA. NegEx - is a tool developed at the University of Pittsburgh to detect

Apache Commons

The Apache Commons is a project of the Apache Software Foundation, formerly under the Jakarta Project. The purpose of the Commons is to provide reusable

Apache OpenNLP

Architecture (UIMA) General Architecture for Text Engineering (GATE) cTAKES "Apache OpenNLP 2.5.3 released". Retrieved 10 January 2025. Apache OpenNLP Website

Java (programming language)

features, offering an implementation compatible with the standard library (Apache Harmony). The use of Java-related technology in Android led to a legal dispute

List of Apache Software Foundation projects

This list of Apache Software Foundation projects includes the software development initiatives maintained by the Apache Software Foundation (ASF). In addition

IBM Watson

researchers. [citation needed] Watson uses IBM's DeepQA software and the Apache UIMA (Unstructured Information Management Architecture) framework implementation

Outline of the Java programming language

Java Edition NetBeans Apache Software Foundation – Apache Commons, Apache Maven, Apache Tomcat, Apache Kafka Eclipse Foundation – Adoptium, Eclipse IDE

List of JVM languages

Solr Apache Spark Apache Xerces Apache Storm Apache Tika Apache Tomcat Apache Uima Apache Velocity Apache Xerces Apache Zookeeper BioJava BlueCove Bouncy