Preprints

Ensembled sparse-input hierarchical networks for high-dimensional datasets
Feng and Simon
[arXiv][code]

Selective prediction-set models with coverage guarantees
Feng, Sondhi, Perry and Simon
[arXiv]

Sparse-Input Neural Networks for High-dimensional Nonparametric Regression and Classification
Feng and Simon
[arXiv][code]

In press

Approval policies for modifications to Machine Learning-Based Software as a Medical Device: A study of bio-creep
Feng, Emerson and Simon
Biometrics, In press
[paper]

Estimation of cell lineage trees by maximum-likelihood phylogenetics
Feng, DeWitt, McKenna, Simon, Willis and Matsen
Annals of Applied Statistics, In press
[paper]

2020

Efficient nonparametric statistical inference on population feature importance using Shapley values
Williamson and Feng
International Conference on Machine Learning (ICML), 2020
[paper][code]

An analysis of the cost of hyper-parameter selection via split-sample validation, with applications to penalized regression
Feng and Simon
Statistica Sinica, 2020
[paper]

2019

Deep generative models for T cell receptor protein sequences
Davidsen, Olson, DeWitt, Feng, Harkins, Bradley and Matsen
Elife, 2019
[paper][code]

Survival analysis of DNA mutation motifs with penalized proportional hazards
Feng, Shaw, Minin, Simon and Matsen
Ann. Appl. Stat., 2019
[paper][code]

2018

Gradient-based Regularization Parameter Selection for Problems With Nonsmooth Penalty Functions
Feng and Simon
J. Comput. Graph. Stat., 2018
[paper][code]

Nonparametric variable importance using an augmented neural network with multi-task learning
Feng, Williamson, Simon and Carone
International Conference on Machine Learning (ICML), 2018
[paper][code]