Illinois English professor Ted Underwood wants to know how the language describing male and female characters in works of fiction has changed since the late 18th century. He’s using data-mining tools to gather information from thousands of books to answer that question. The problem, though, is that books published after 1922 are still under copyright protection and their content can’t be shared freely online. "There are hundreds of thousands of books out there, and we don’t talk about them," Underwood said. "That is a dark landscape after the wall of copyright comes down. We can read the books one by one, but we can’t make generalizing claims at all." The HathiTrust Research Center is leading the Mellon-funded project to provide greater access to the digitized HathiTrust library. TO READ FURTHER, PLEASE VISIT https://itnews.iu.edu/articles/2016/project-will-help-researchers-explore-big-data-in-hathitrust-digitized-library.php.