Publishing On OpenWetWare - Lessons Learned 3 - WikiToLatex and Publishing on the arXiv
This is the fourth report of the ‘Publishing on OpenWetWare’ series. In brief, I am writing an article on OWW from start to finish: initial writing -> collecting comments -> publishing on arXiv.org -> presenting at a conference. For other articles, see one, two, and three. In this report, I’ll share my experiences in how easy it was to convert the wiki text into latex and publish the paper on the arXiv.
The Setup
The article, ‘Python - All A Scientist Needs’ can be seen here. Once the article was in its final form, I wanted to formalize its publication by submitting a copy of it to the arXiv. The arXiv takes PDF submissions, so all I really had to do was to print the OWW page to PDF and submit that. However, I wanted things to look a bit nicer so I chose to convert the wiki text into latex, and submit a pdf generated by latex. This way I could also use the latex hyperref package to put clickable links within the PDF.
Wiki -> Latex
The final latex source of the article can be found here. To convert the wiki text into latex, I had to perform the following steps:
- Create the base latex file.
- Input the wiki text: To do this, I juts selected edit on the wiki page and copied and pasted the whole article into the latex document. Initially, I commented out the code examples and figures to deal with them later.
- Replace the wiki markup with latex markup:
- Make single quotes into latex quotes (” -> `’)
- Make italic quotes into emph (”” -> \emph{})
- Make bold quotes into \bf (””” -> {\bf})
- Replace tags with \cite (regexp (.*?) -> \cite{$1})
- Replace tags with $ (regexp (.*?) -> \$$1\$)
- Replace # lists with enumerate environments
- Replace * lists with itemize environments
- Added references: Here I had to manually convert the biblio package references into \bibitem’s in the latex source. I used the hyperref package and the \href command to link to resources on the web.
- Added the code examples: I chose the listings package, although it was a bit confusing to customize. In fact, the inadequacy of the package necessitated a code restructuring.
- Added the figure.
Overall, this was a pretty painful process, even for someone with a lot of experience with wikis and latex. It was hard to keep the wiki version and the latex version’s synced together as well. Overall the final PDF does not look as good as the wiki page. In total, the latexing took about 6 hours of work.
Publishing on the arXiv
Publishing the paper on the arXiv was very straight-forward. I simply filled out a form with the article metadata (title, abstract, etc.), uploaded my latex and figure files, and it was posted on the arXiv within a couple of days. You can see the article here.
There was a minor issue with copyright licensing that should be mentioned. OWW uses a Creative Commons Attribution-ShareAlike license. The arXiv offers a couple of license choices (CC-Attribution and CC-NonCommercial-Attribution-ShareAlike), but neither of them matched OWW’s license exactly. In the end I chose CC-Attribution, because I did not want to have the NonCommercial issue. We should probably sort this matter out if we want to create a lot of arXiv papers from OWW.
Conclusions
In the end, the process was worth it because there is now a referenceable copy of the article on the arXiv. (Referenceable in the sense that journals will accept a reference to an arXiv paper.) It took a little while to convert the wiki text into latex, but a lot of that can be facilitated by an automatic conversion tool.
Please let me know what you think by leaving a comment on this post!
Posted: March 15th, 2008 under Publishing.
Comments: none
Write a comment