Automatic Indexing of LaTex Documents

A couple weeks ago I mentioned in a post that I was working on a Python script to automatically generate indexes of books written in the LaTex typesetting system.  At the time I promised to post the script in “a couple of days”.  Predictably, weeks have passed, my little script has ballooned into a full on open-source software project, and the code is now too long to post (or explain) in a single blog article.  If you’re interested, however, you can now download my alpha release from sourceforge.

The package includes two Python programs.  Indexmeister is a console utility which reads a file (in several formats, not just LaTex) and suggests terms for indexing.  It uses three different methods to figure out which terms are important.  Imbrowse is a Curses program which helps you interactively browse multi-file LaTex books and quickly insert the right tags to generate an index.

I made this video tutorial to show how the system works:

In the future I am thinking of adding a plug-in for LibreOffice, and possibly a graphical interface (probably using GTK bindings). Porting it to Windoze is not a priority, however.

Advertisements

Posted on December 1, 2015, in Breaking News, Writing and tagged , , . Bookmark the permalink. Leave a comment.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: