Browsing by Author "Ding, X"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item Restricted Evidence-ranked motif identification.(Genome Biol, 2010) Georgiev, S; Boyle, AP; Jayasurya, K; Ding, X; Mukherjee, Sayan; Ohler, UwecERMIT is a computationally efficient motif discovery tool based on analyzing genome-wide quantitative regulatory evidence. Instead of pre-selecting promising candidate sequences, it utilizes information across all sequence regions to search for high-scoring motifs. We apply cERMIT on a range of direct binding and overexpression datasets; it substantially outperforms state-of-the-art approaches on curated ChIP-chip datasets, and easily scales to current mammalian ChIP-seq experiments with data on thousands of non-coding regions.Item Open Access On the spectral property of kernel-based sensor fusion algorithms of high dimensional dataDing, X; Wu, HTIn this paper, we apply local laws of random matrices and free probability theory to study the spectral properties of two kernel-based sensor fusion algorithms, nonparametric canonical correlation analysis (NCCA) and alternating diffusion (AD), for two sequences of random vectors $\mathcal{X}:=\{\xb_i\}_{i=1}^n$ and $\mathcal{Y}:=\{\yb_i\}_{i=1}^n$ under the null hypothesis. The matrix of interest is a product of the kernel matrices associated with $\mathcal{X}$ and $\mathcal{Y}$, which may not be diagonalizable in general. We prove that in the regime where dimensions of both random vectors are comparable to the sample size, if NCCA and AD are conducted using a smooth kernel function, then the first few nontrivial eigenvalues will converge to real deterministic values provided $\mathcal{X}$ and $\mathcal{Y}$ are independent Gaussian random vectors. We propose an eigenvalue-ratio test based on the real parts of the eigenvalues of the product matrix to test if $\mathcal{X}$ and $\mathcal{Y}$ are independent and do not share common information. Simulation study verifies the usefulness of such statistic.