ResearchMy research interests lie in the intersection of machine learning and optimization, as well as their applications on data mining and biomedical data science. In particular, my research focuses on answering the following three questions:
Modeling: Machine/Deep Learning on GraphsTo have a well-performing machine learning model, a critical step is to capture the intrinsic structure of the data, such as the local correlation of pixels in the image data, the dependence between different words in the language data. Different from these regular data, graph data encode more complicated relational information between different instances. Regular machine learning models fail to deal with the intrinsic relational information so that they cannot be applied to graph data directly. My research focus is to design new machine/deep learning models to effectively explore the relational information for graph data.
Optimization: Large-Scale Optimization/Training MethodsAfter having a machine learning model, a critical step is to optimize this model to get optimal model parameters. Considering the scalability of models and datasets, my research is focusing on designing efficient stochastic optimization algorithms to train large-scale non-convex machine learning models. In addition, deep neural networks are highly non-linear. They are easy to overfit and difficult to train. Besides optimization methods, I also work on designing efficient training methods for deep neural networks to avoid the overfitting issue and stabilize the training procedure.
Application: Biomedical Data Science & Online AdvertisingBesides the methodology side, I am also interested in applying machine learning to other fields, such as bioinformatics, online advertising. To design effective machine learning models for practical applications, it is important to incorporate the domain-specific knowledge. To bridge the gap between general machine learning models and the domain-specific application, my research work is to design new machine/deep learning models to fully exploit domain knowledge for better prediction.
|