Preprints

Sparse-Input Neural Networks for High-dimensional Nonparametric Regression and Classification
Feng and Simon
[arXiv][code]

Sequential algorithmic modification with test data reuse
Feng, Pennello, Petrick, Sahiner, Pirracchio and Gossmann
[arXiv]

2022

Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees
Feng, Gossmann, Sahiner and Pirracchio
Journal of the American Medical Informatics Association, 2022
[paper][code]

Ensembled sparse-input hierarchical networks for high-dimensional datasets
Feng and Simon
Statistical Analysis and Data Mining, 2022
[paper][code]

Overcoming barriers in the design and implementation of clinical trials for Acute Kidney Injury: a report from the 2020 Kidney Disease Clinical Trialists meeting
Lazzareschi, Mehta, Dember, Bernholz, Turan, Sharma, Kheterpal, Parikh, Ali, Schulman, Ryan, Feng, Simon, Pirracchio, Rossignol and Legrand
Nephrol. Dial. Transplant, 2022
[paper]

2021

Approval policies for modifications to Machine Learning-Based Software as a Medical Device: A study of bio-creep
Feng, Emerson and Simon
Biometrics, 2021
[paper][code]

Estimation of cell lineage trees by maximum-likelihood phylogenetics
Feng, DeWitt, McKenna, Simon, Willis and Matsen
Annals of Applied Statistics, 2021
[paper][code]

Learning to safely approve updates to machine learning algorithms
Feng
Proceedings of the Conference on Health, Inference, and Learning, 2021
[paper][code]

Selective prediction-set models with coverage guarantees
Feng, Sondhi, Perry and Simon
Biometrics, 2021
[paper][code]

2020

Efficient nonparametric statistical inference on population feature importance using Shapley values
Williamson and Feng
International Conference on Machine Learning (ICML), 2020
[paper][code]

An analysis of the cost of hyper-parameter selection via split-sample validation, with applications to penalized regression
Feng and Simon
Statistica Sinica, 2020
[paper]

2019

Deep generative models for T cell receptor protein sequences
Davidsen, Olson, DeWitt, Feng, Harkins, Bradley and Matsen
Elife, 2019
[paper][code]

Survival analysis of DNA mutation motifs with penalized proportional hazards
Feng, Shaw, Minin, Simon and Matsen
Ann. Appl. Stat., 2019
[paper][code]

2018

Gradient-based Regularization Parameter Selection for Problems With Nonsmooth Penalty Functions
Feng and Simon
J. Comput. Graph. Stat., 2018
[paper][code]

Nonparametric variable importance using an augmented neural network with multi-task learning
Feng, Williamson, Simon and Carone
International Conference on Machine Learning (ICML), 2018
[paper][code]