
浏览量:3395
Chi Wang
Researcher
Microsoft Corporation
Login to view more

My research is centered around data science, an interplay between theories and systems.
论文共 63 篇
NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization
Selectivity estimation for range predicates using lightweight models
Empirical Entropy Approximation via Subsampling: Theory and Application
Efficient Attribute Recommendation with Probabilistic Guarantee.
ABC: Efficient Selection of Machine Learning Configuration on Large Dataset
Trust, but Verify: Optimistic Visualizations of Approximate Queries for Exploring Big Data.
Accounting for the Correspondence in Commented Data
Identifying Outlier Arms in Multi-Armed Bandit.
Identifying Semantically Deviating Outlier Documents.
Automatic Entity Recognition and Typing in Massive Text Corpora.
Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee.
Scalable topical phrase mining from text corpora
A privacy mechanism for mobile-based urban traffic monitoring
Mining Latent Entity Structures
ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering
Towards Interactive Construction of Topical Hierarchy: A Recursive Tensor Decomposition Approach
Concept Expansion Using Web Tables
GIN: A Clustering Model for Capturing Dual Heterogeneity in Networked Data.
Mining Quality Phrases from Massive Text Corpora
Bringing structure to text: mining phrases, entities, topics, and hierarchies
Automatic Construction and Ranking of Topical Keyphrases on Collections of Short Documents.
Scalable Moment-Based Inference for Latent Dirichlet Allocation.
NewsNetExplorer: automatic construction and exploration of news information networks
Mining latent entity structures from massive unstructured and interconnected data
Constructing Topical Hierarchies in Heterogeneous Networks
Large-scale spectral clustering on graphs
Constructing Topical Hierarchies in Heterogeneous Information Networks
Semantic Frame-Based Document Representation for Comparable Corpora.
Multi-View Clustering via Joint Nonnegative Matrix Factorization.
On the Detectability of Node Grouping in Networks.
AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data
Content coverage maximization on word networks for hierarchical topic summarization
Ranking-based name matching for author disambiguation in bibliographic data
Targeted disambiguation of ad-hoc, homogeneous sets of named entities
Scalable influence maximization for independent cascade model in large-scale social networks.
Learning online discussion structures by conditional random fields
Learning relevance from heterogeneous social network and its application in online targeting
LikeMiner: a system for mining the power of 'like' in social media networks
Dynamic Social Influence Analysis through Time-Dependent Factor Graphs
WINACS: construction and analysis of web-based computer science information networks
Mining advisor-advisee relationships from research publication networks
On community outliers and their efficient detection in information networks
Scalable influence maximization for prevalent viral marketing in large-scale social networks
Decomposition: Privacy Preservation for Multiple Sensitive Attributes
Social influence analysis in large-scale networks
BSGI: An Effective Algorithm towards Stronger l-Diversity