CENdetectHOR is a novel computational tool designed for the identification and analysis of higher-order repeat (HOR) arrays in centromeric regions across diverse organisms. This tool is particularly significant due to the complex and variable nature of centromeric DNA, which is characterized by long arrays of repetitive satellite DNA that exhibit extensive variation in sequence, length, and organization across species.
CENdetectHOR has been validated using human and Arabidopsis thaliana genomes, demonstrating its efficacy in revealing the complexity and diversity of centromeric architectures. It allows researchers to:
The ability to pinpoint specific HOR variants and elucidate interindividual and interspecific variations in centromeric regions establishes CENdetectHOR as a powerful tool for advancing research on centromere structure, evolution, and function. This capability is essential for comparative studies across different species, contributing to a deeper understanding of centromere biology.
In summary, CENdetectHOR provides a comprehensive platform for exploring centromere variation across species, facilitating significant advancements in genomic research and our understanding of centromere dynamics.
import pandas as pd import plotly.express as px # Sample data for HORs identified in human chromosomes data = { 'Chromosome': ['1', '2', '3', '4', '5', '6', '7', '8', '9', '10', '11', '12', '13', '14', '15'], 'Monomer_Count': [30, 14, 15, 19, 6, 18, 6, 8, 11, 8, 12, 8, 7, 16, 11], 'HORs': [ ['C1H2', 'C1H12', 'C1H16'], ['C2H2', 'C2H4'], ['C3H15'], ['C4H19'], ['C5H4'], ['C6H18', 'C6H15'], ['C7H6'], ['C8H8'], ['C9H11'], ['C10H14'], ['C11H5', 'C11H12'], ['C12H8'], ['C13H7'], ['C14H16'], ['C15H11'] ] } # Create DataFrame df = pd.DataFrame(data) # Plotting the data fig = px.bar(df, x='Chromosome', y='Monomer_Count', title='Monomer Count in Human Chromosomes', labels={'Monomer_Count':'Number of Monomers', 'Chromosome':'Chromosome'}, color='Monomer_Count') fig.show()