Publications and Theses

The documents referenced below are included by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright.


  1. Sourav Chatterji, Ichitaro Yamazaki, Zhaojun Bai and Jonathan Eisen, CompostBin: A DNA composition-based algorithm for binning environmental shotgun reads , to appear in RECOMB 2008.
  2. Drosophila Comparative Genome Sequencing and Analysis Consortium, Evolution of genes and genomes in the context of the Drosophila phylogeny, Nature, 450 (2007), p 203-218.
  3. Sourav Chatterji and Lior Pachter, Patterns of gene duplication and intron loss in the ENCODE regions suggest a confounding factor, Genomics 2007, 90(1):44-48.
  4. Sourav Chatterji and Lior Pachter, Reference based gene annotation with GeneMapper. Genome Biology 2006, 7(4):R29.
  5. Sourav Chatterji and Lior Pachter, Large multiple organism genefinding by collapsed Gibbs Sampling. Journal of Computational Biology 2005, 12(6):599-608.
  6. Chicken Genome Sequencing Consortium, Sequence and comparatve analysis of chicken genome provide unique perspectives into vertebrate evolution. Nature 2004, 432(7018):695-716.
  7. Rat Genome Sequencing Consortium, Genome sequence of the Brown Norway rat yields insights into mammalian evolution.Nature 2004, 428(6982):493-521.
  8. Adam M. Breier, Sourav Chatterji and Nicholas R. Cozzarelli, Prediction of Saccharomyces cerevisiae Replication Origins.  Genome Biology 2004, 5(4):R22.
  9. Sourav Chatterji and Lior Pachter, Multiple Organism Gene Finding by Collapsed Gibbs Sampling. Proceedings of the Eighth Annual International Conference on Computational Molecular Biology 2004, 187-193.
  10. Sourav Chatterji, Manikandan Narayanan, Jason Duell and Leonid Oliker. Performance Evaluation of Two Emerging Media Processors: VIRAM and Imagine. (IPDPS 2003) [pdf]
  11. Sourav Chatterji, S.S.K Evani, Sumit Ganguly, Mahesh Datt Yemmanuru. On the Complexity (Non-Approximability) of Query Optimization. (PODS 2002) [postscript]
  12. Sourav Chatterji , Y. Mahesh Datt. On Finding Approximate Solutions to the Query Optimization Problem (B. Tech Thesis IIT Kanpur). [ps.gz]