Text Editing Encoding Analysis

From DARC (Digital Archive Research Collective)
Jump to navigation Jump to search

The following is a list of resources for projects featuring text editing, encoding, and analysis. These markup languages, text editors, and text mining methods allow researchers to clean, sort, process, and display textual data.

  • XML, or eXtensible Markup Language, is a language for text encoding, or marking up text to make it both human and machine readable.
  • TEI, or Text Encoding Initiative, is an extension of XML for encoding textual data (such as manuscripts) in humanities, social sciences and linguistics contexts.
  • XSLT, or eXtensible Stylesheet Language Transformations, is a language for transforming XML documents into other formats, such as HTML for web publishing.
  • Text Analysis allows researchers to extract information about texts by processing them with computational tools such as Python and MALLET.
  • Text Editors such as VSCode and Atom allow researchers to create, clean, and manage text, essential for any digital project that involves working with plain text, markup, or programming languages.