Contact
Mail : etienne.julien@uphf.fr
Linkedin : https://www.linkedin.com/in/etiennejulien/
As part of my academic work in 2024, I wrote an article in IEEE format exploring the concept of JPEG DNA : a compression norm that try to develop efficient image coding specifically adapted for synthetic DNA-based storage. This work provides an overview of methods related to coding on DNA and his use.
JPEG DNA is a standard for encoding images specifically for synthetic DNA data storage. It adapts traditional JPEG, using the Discrete Cosine Transform (DCT) to get image coefficients, but maps these coefficients to the four DNA nucleotides (A, T, C, G), often involving techniques like ternary Huffman and Goldman encoding. The key challenges are to achieve high compression to lower the expensive DNA synthesis cost and to follow strict biochemical rules for DNA stability and synthesis.
Why make a compression norm for stockage in DNA? Synthesized DNA molecules are considered a promising option for cold data, such as archives or rarely accessed media storage, due to their unique biochemical properties, including potential for storing massive amounts of data in a small space with remarkable stability and longevity. But now, the technology is only in the beginning. His major problem is the synthesizing and sequencing speed. Indeed, in 2022, it takes 40 seconds to write a single nuclear base with enzymatic synthesis.
In 2024, I explored this topic in an article written in IEEE format, where I presented an overview of DNA data storage and the JPEG DNA standard currently under development. My aim was to provide a clear and accessible explanation of the core principles behind this emerging field. You can press the button to read the full article