Analysis of cross sequence similarities for multiple DNA sequences compression

Paula Wu, Ngai Fong Law, Wan Chi Siu

Research output: Journal article publicationJournal articleAcademic researchpeer-review

7 Citations (Scopus)


Current DNA compression algorithms rely on finding repetitions within the DNA sequence so that similar subsequences can be encoded by referencing to each other. We explore similarities between different chromosomes of the sequence ‘Saccharomyces cerevisiae’. These similarities are characterised by the existence of similar subsequences among different chromosomes. The longer the similar subsequences are, the higher the cross-similarities are. Our study indicates that these cross-sequence similarities are often significant as compared to self-sequence similarity. This implies that it would be advantageous to compress two or more chromosome sequences together so that similar subsequences found between multiple chromosome sequences can be encoded together.
Original languageEnglish
Pages (from-to)437-454
Number of pages18
JournalInternational Journal of Computer Aided Engineering and Technology
Issue number4
Publication statusPublished - 1 Jan 2009


  • chromosome
  • compression
  • computer aided engineering
  • cross chromosomal similarities
  • deoxyribonucleic acid sequence
  • DNA sequence
  • multiple chromosome
  • multiple DNA sequences
  • prediction
  • Saccharomyces cerevisiae
  • technology

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Computer Science Applications


Dive into the research topics of 'Analysis of cross sequence similarities for multiple DNA sequences compression'. Together they form a unique fingerprint.

Cite this