BindLine

Introduction

BindLine is an interactive web tool for exploring protein–DNA interactions using Protein Binding Microarray (PBM) data. PBM experiments measure the relative binding strength of all possible DNA k-mers (sequences of length "k"), allowing binding preferences to be examined along full DNA sequences rather than summarized as a single consensus motif.
BindLine visualizes these quantitative scores directly on any DNA sequence, enabling identification of binding sites even when binding is weak, degenerate, or distributed across multiple sequence contexts.
BindLine is an interactive tool, allowing users smooth exploration of the data through zooming, panning, and toggling visibility of sequences and proteins.

Basic usage

Protein and DNA input

Load Sample

BindLine operates on protein PBM results and DNA sequences.
You may choose from public PBM datasets or upload your own. As a logged-in user, you may upload files for continuous use. Some of the public results, and possibly yours, have only part of the score types (see below). Protein may be associated with to one or more families (PFAMs), which are used for grouping in plots.
DNA sequences can be loaded from FASTA files or entered directly in the sequence table. All sequences in a single analysis must be aligned. The first sequence is treated as the reference sequence, while the others are interpreted as substitutions, insertions, or deletions relative to it. The “Load Sequences” button allows reviewing or preparing sequences before plotting.

Scoring

Load Sample

BindLine supports three types of PBM scores to evaluate protein–DNA binding:

I-score – Raw median fluorescence intensity.
Z-score – A measure of binding deviation from the median level observed across all k-mers.
E-score – Rank-based normalized enrichment score (−0.5 to 0.5).

Score types that are not included in from the uploaded PBM results are unsupported. You can choose the desired type before analysis. The "BindLine" plot displays the selected score along the aligned sequences. Each point corresponds to the score of the k-mer starting at that nucleotide position.

Threshold settings

Load Sample

BindLine offers the application of thresholds for each score type, enabling identification of significant binding events. You may enable and adjust them with sliders or numeric input fields. Ranks threshold, which highlights only the top-scoring k-mers, is also available. When thresholds are enabled, the "Binding Sites" plot appears and visualizes overlapping, competing, or nearby binding events.

Advanced usage

Search for binding sites across all proteins

Load Sample

Scan all available protein datasets and display only those with at least one k-mer exceeding the selected threshold in the input sequences. This mode is useful when the relevant transcription factor is unknown. We recommend not using a low threshold, as the results will be noisy.

Search for significant point mutations

Load Sample

Analyze the effects of single-nucleotide substitutions, insertions, and deletions derived from a reference sequence. Only mutations that create binding sites based on the selected threshold are shown. Mutations that produce identical binding behavior are collapsed and shown only once. When using this option, the "All Mutants" plot appears. It summarizes the effect of each substitution mutation on binding scores.

Show differences from the reference sequence only

Load Sample

Restrict visualization to mutations that create or abolish binding sites. De-Novo binding sites are shown in color, while destroyed sites are shown in gray.

Plots

Continuous visualization of PBM scores along aligned DNA sequences. Hovering the mouse over points highlights the corresponding k-mer. Clicking a legend entry toggles visibility of that sequence. Double-clicking a legend entry isolates that sequence. If more than one sequence / protein are shown, additional legend entries appear to control groups of lines of the same sequence / protein. Maximal score for each protein is indicated by a dashed line. Zooming by click-and-drag or by control buttons changes the x-axis range in all plots simultaneously.

Binding Sites

Discrete visualization of binding events enabled by score thresholds. Each binding site is represented as a rectangle spanning the sequence from the first to last nucleotide of the k-mers above the selected threshold. There may be overlapping binding sites for the same protein. By default, if there are multiple protein families (PFAMs) that the examined proteins belong to, grouping of their binding sites is performed to reduce visual clutter. By clicking a binding site rectangle, the traces of its PFAM are grouped / ungrouped. A toggle button allows switching between these views for all binding sites.

All Mutants

Summary of the effects of individual point mutations on binding scores. Each bar represents a single-nucleotide substitution relative to the reference sequence. The value indicates the difference between the highest score in the k-mer range of the mutant and the reference. A toggle button allows switching between the absolute value (score) and the effect of the mutation (difference = Δscore). It is important to pay attention to the absolute score values, and not to the mutation effect only. For that, it is recommended to look only at sites of interest in this plot.

Data and personal dashboard

Public PBM score files and FASTA sequences can be downloaded from the Data page. Logged-in users can manage uploaded files through their personal dashboard, including downloading or deleting files at any time.

Help

Contents

Quick start

Introduction

Basic usage

Protein and DNA input

Scoring

Threshold settings

Advanced usage

Search for binding sites across all proteins

Search for significant point mutations

Show differences from the reference sequence only

Plots

BindLine

Binding Sites

All Mutants

Data and personal dashboard