(20140916)This thesis explores various methods for analyzing data generated using the nextgeneration sequencing technology, RNASeq. Two methods are developed which attempt to accurately calculate RNA expression, the first using a ...
(20130822)This dissertation is centered on the modeling of heterogeneous data which is ubiquitous in this digital information age. From the statistical point of view heterogeneous data is composed of dissimilar components, where ...
(20100831)The protein lysate array is an emerging technology for quantifying the protein concentration ratios in multiple biological samples. It is gaining popularity, and has the potential to answer questions about posttranslational ...
(20150421)Dyadic data are ubiquitous and arise in the fields of biology, epidemiology, sociology, and many more. Such dyadic data are often best understood within the framework of networks. Network data can vary in many ways. For ...
(20130822)Nowadays in many statistical applications, we face models whose complexity increases with the sample size. Such models pose a challenge to the traditional statistical analysis, and call for new methodologies and new ...
Stopping Time of Invariant Sequential Probability Ratio Tests in Multivariate Analysis of Variance (1974)
(1995)The simultaneous and nonparametric estimation of latent abilities and item characteristic curves is considered. In particular, the joint asymptotic properties of ordinal ability estimation and kernel smoothed nonparametric ...
(2003)Imprecisely or incompletely observed data often appear in engineering, epidemiology and economic studies, observations on certain variables may be grouped or measured with errors, which poses challenges to the usual ...
(20140916)This thesis contains two research areas including timevarying networks estimation and Chinese words segmentation. Chapter 1 introduces the background of the timevarying networks and the structure of Chinese language, ...
(2005)The press releases from the Federal Open Market Committee(FOMC) are one of the major inputs to the interest rate futures market. To estimate the impact associated with the FOMC announcements, the random intervention model ...
(1991)Modern scientific researchers often use complex computer simulation codes for theoretical investigations. We model the response of computer simulation code as the realization of a stochastic process. This approach, design ...
(2000)In the last part of the thesis we discuss the methods for handling censored data for both categorical and continuous responses. We develop EM algorithms for censored data in general, and also develop a weighted least ...
(1989)A popular nonparametric measure of a monotone relation between two variables is Kendall's tau. Originally, most analysis of this statistic assumed the two variables were independent, while more recently there has been ...
(1993)Several nonparametric tests based on ranks have been proposed to handle analysis of variance problems. But questions have been raised about the appropriateness of some procedures that are mere rank transformations of normal ...
(20140530)Many variable selection methods are available for linear regression but very little has been developed for quantile regression, especially for the censored problems. This study will look at the possibilities of utilizing ...
(20120522)In the marketing research world today, companies have access to massive amounts of data regarding the purchase behavior of consumers. Researchers study this data to understand how outside factors, such as demographics and ...
(1996)We first consider convergence in law of measurable processes with a general parameter set and a state space. To this end, we need to investigate topological properties of the space of measurable functions which is the paths ...
(20150716)Weak signal identification and inference are very important in the area of penalized model selection, yet they are underdeveloped and not wellstudied. Existing inference procedures for penalized estimators are mainly ...
