Skip to main content
SLU:s publikationsdatabas (SLUpub)

Sammanfattning

Understanding the architecture of translational regulatory sequences in diverse chloroplasts is critical for advancing synthetic biology and genetic engineering. In this study, a hybrid deep learning model combining convolutional neural network (CNN), long short-term memory (LSTM), Attention, and Residual architectures was developed to classify and analyse two datasets: 5′ untranslated region sequences from plants and algae, and the sequences with and without Shine-Dalgarno (SD) motifs from both groups. Using 300-nucleotide leader sequences upstream of the start codon as input, the model achieved strong prediction performance for both taxonomic origin and the presence or absence of SD motifs. However, a small subset of plant and algal sequences exhibited algal-like and plant-like patterns, respectively—an encouraging finding for identifying functional heterologous sequences from one group for use in the other group’s genome. The results further revealed significant differences in the plastid leader sequences between the datasets (Plants vs. Algae and SDs vs. without SDs), emphasising distinct features in the first 30 bp upstream of the start codon. This study proposes two potential strategies for introducing heterologous leader sequences in algal plastome engineering: (1) employing plant-derived leader sequences with algal-like patterns tailored to specific algal strains, and (2) constructing hybrid leader sequences harbouring SD motifs by fusing algae-specific ~30 bp upstream regions with their respective plant-derived distal regions. As the first deep learning model to analyse chloroplast translational regulatory sequences, the findings offer valuable guidance for identifying and predicting heterologous leader sequences in plants and algae.

Nyckelord

chloroplast; Convolutional Neural Network; Shine-Dalgarno motifs; translation; algae; CNN-LSTM; genetic engineering; leader sequence

Publicerad i

Frontiers in Plant Science
2025, volym: 16, artikelnummer: 1698951

SLU författare

UKÄ forskningsämne

Växtbioteknologi

Publikationens identifierare

  • DOI: https://doi.org/10.3389/fpls.2025.1698951

Permanent länk till denna sida (URI)

https://res.slu.se/id/publ/145582