Skip to Main Content

Digital Humanities

A detailed guide explaining what Digital Humanities is

Text Encoding and Markup


Text encoding and markup in DH is the process of creating structures and annotations to textual data (rather than numerical data) so that it is easily processed and analyzed by computers.

Text encoding ensures that non-western/Latin script can be represented in digital texts (such as Arabic, hieroglyphs, Zapotec, etc.), thus allowing for easy and equal accessibility for researchers around the world.

Text Markup uses tags or codes to define the format and the presentation of the text. You can think of text markup up as the ordering and adornment of the encoded text. 

Digital Text Collections

Digital textual collections and archives are sometimes (but not always) made available for free online. They can contain collections of books, manuscripts, newspapers, letters, etc. Some of their features include interactivity, navigability, and the option for reading in various formats (pdf, ebook, HTML text, etc.). The following are examples of free collections.