Browse Research and Tech Reports - Computer Science by Author "Han, Jiawei"

  • Yan, Xifeng; Yang, Jiong; Wang, Wei; Han, Jiawei (2005-02)
    Sensor networks have been deployed in various environments, from battle field surveillance to weather monitoring. The amount of data generated by the sensors can be large. One way to analyze such large data set is to capture ...

    application/pdf

    application/pdfPDF (158kB)
  • Xin, Dong; Shao, Zheng; Han, Jiawei; Liu, Hongyan (2005-10)
    It is well recognized that data cubing often produces huge outputs. Two popular efforts devoted to this problem are (1) iceberg cube, where only significant cells are kept, and (2)closed cube, where a group of cells which ...

    application/pdf

    application/pdfPDF (209kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2007-08)
    Linear Discriminant Analysis (LDA) has been a popular method for extracting features which preserve class separability. The projection vectors are commonly obtained by maximizing the between class covariance and simultaneously ...

    application/pdf

    application/pdfPDF (197kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2008-02)
    Recently Non-negative Matrix Factorization (NMF) has received a lot of attentions in information retrieval, computer vision and pattern recognition. NMF aims to find two non-negative matrices whose product can well approximate ...

    application/pdf

    application/pdfPDF (109kB)
  • Ji, Ming; Sun, Yizhou; Danilevsky, Marina; Han, Jiawei (2010-04-30)
    A heterogeneous information network is a network composed of multiple types of objects and links. Recently, it has been recognized that strongly-typed heterogeneous information networks are prevalent in the real world. ...

    application/pdf

    application/pdfPDF (242kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2006-07)
    Recently the problem of dimensionality reduction has received a lot of interests in many fields of information processing, including data mining, information retrieval, and pattern recognition. We consider the case where ...

    application/pdf

    application/pdfPDF (819kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2006-04)
    Most of the existing learning algorithms take vectors as their input data. A function is then learned in such a vector space for classification, clustering, or dimensionality reduction. However, in some situations, there ...

    application/pdf

    application/pdfPDF (185kB)
  • Lu, Ying; Han, Jiawei (2005-07)
    Maintaining frequency counts for data streams has attracted much interest among the research community recently since it provides the base for many stream mining applications. Most existing work followed the same paradigm: ...

    application/pdf

    application/pdfPDF (676kB)
  • Cai, Deng; Shao, Zheng; He, Xiaofei; Yan, Xifeng; Han, Jiawei (2005-03)
    Social network analysis has attracted much attention in recent years. Community mining is one of the major directions in social network analysis. Most of the existing methods on community mining assume that there is only ...

    application/pdf

    application/pdfPDF (235kB)
  • Cai, Deng; Mei, Qiaozhu; He, Xiaofei; Han, Jiawei (2008-01)
    Topic modeling has been a key problem for document analysis. One of the canonical approaches for topic modeling is Probabilistic Latent Semantic Indexing, which maximizes the joint probability of documents and terms in the ...

    application/pdf

    application/pdfPDF (142kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2006-07)
    A novel approach to linear dimensionality reduction is introduced that is based on Locality Preserving Projections (LPP) with a discretized Laplacian smoothing term. The choice of penalty allows us to incorporate prior ...

    application/pdf

    application/pdfPDF (360kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2006-07)
    Graph-based approaches for semi-supervised learning have received increasing amount of interest in recent years. Despite their good performance, many pure graph based algorithms do not have explicit functions and can not ...

    application/pdf

    application/pdfPDF (200kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2007-05)
    Spectral methods have recently emerged as a powerful tool for dimensionality reduction and manifold learning. These methods use information contained in the eigenvectors of a data affinity (\ie, item-item similarity) matrix ...

    application/pdf

    application/pdfPDF (273kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2007-05)
    Linear Discriminant Analysis (LDA) has been a popular method for extracting features which preserve class separability. The projection functions of LDA are commonly obtained by maximizing the between class covariance and ...

    application/pdf

    application/pdfPDF (249kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2005-05)
    Linear dimensionality reduction techniques have been widely used in pattern recognition and computer vision, such as face recognition, image retrieval, etc. The typical methods include Principal Component Analysis (PCA) ...

    application/pdf

    application/pdfPDF (220kB)
  • Cai, Deng; He, Xiaofei; Wen, Ji-Rong; Han, Jiawei; Ma, Wei-Ying (2006-04)
    We consider the problem of text representation and categorization. Conventionally, a text document is represented by a vector in high dimensional space. Some learning algorithms are then applied in such a vector space for ...

    application/pdf

    application/pdfPDF (220kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2006-04)
    Vector Space Model (VSM) has been at the core of information retrieval for the past decades. VSM considers the documents as vectors in high dimensional space. In such a vector space, techniques like Latent Semantic Indexing ...

    application/pdf

    application/pdfPDF (234kB)
  • Lee, Jae-Gil; Han, Jiawei; Whang, Kyu-Young (2007-03)
    Existing trajectory clustering algorithms group similar trajectories as a whole, thus discovering common trajectories. Our key observation is that clustering trajectories as a whole could miss common sub-trajectories. ...

    application/pdf

    application/pdfPDF (885kB)
  • Khan, Mohammad Maifi Hasan; Le, Khac Hieu; Ahmadi, Hossein; Abdelzaher, Tarek F.; Han, Jiawei (2010)
    This article presents a tool for uncovering bugs due to interactive complexity in networked sensing applications. Such bugs are not localized to one component that is faulty, but rather result from complex and unexpected ...

    application/pdf

    application/pdfPDF (366kB)
  • Cai, Deng; He, Xiaofei; Han, Jiawei (2005-09)
    Previous work has demonstrated that the image variations of many objects (human faces in particular) under variable lighting can be effectively modelled by low dimensional linear spaces. The typical methods for learning a ...

    application/pdf

    application/pdfPDF (494kB)