LDC collects and distributes speech and text databases, lexicons, and other resources for linguistics research and development purposes. These resources are essential for various fields such as corpus linguistics, machine translation, natural language processing, and speech technology. The Linguistic Data Consortium (LDC) is an open consortium of universities, companies, and government research laboratories. It was founded in 1992 and is hosted by the University of Pennsylvania.