Research reports

Towards a regularity theory for ReLU networks -- chain rule and global error estimates

by J. Berner and D. Elbrächter and P. Grohs and A. Jentzen

(Report number 2019-26)

Abstract
Although for neural networks with locally Lipschitz continuous activation functions the classical derivative exists almost everywhere, the standard chain rule is in general not applicable. We will consider a way of introducing a derivative for neural networks that admits a chain rule, which is both rigorous and easy to work with. In addition we will present a method of converting approximation results on bounded domains to global (pointwise) estimates. This can be used to extend known neural network approximation theory to include the study of regularity properties. Of particular interest is the application to neural networks with ReLU activation function, where it contributes to the understanding of the success of deep learning methods for high-dimensional partial differential equations.

Keywords:

BibTeX
@Techreport{BEGJ19_830,
  author = {J. Berner and D. Elbr\"achter and P. Grohs and A. Jentzen},
  title = {Towards a regularity theory for ReLU networks -- chain rule and global error estimates},
  institution = {Seminar for Applied Mathematics, ETH Z{\"u}rich},
  number = {2019-26},
  address = {Switzerland},
  url = {https://www.sam.math.ethz.ch/sam_reports/reports_final/reports2019/2019-26.pdf },
  year = {2019}
}

Disclaimer
© Copyright for documents on this server remains with the authors. Copies of these documents made by electronic or mechanical means including information storage and retrieval systems, may only be employed for personal use. The administrators respectfully request that authors inform them when any paper is published to avoid copyright infringement. Note that unauthorised copying of copyright material is illegal and may lead to prosecution. Neither the administrators nor the Seminar for Applied Mathematics (SAM) accept any liability in this respect. The most recent version of a SAM report may differ in formatting and style from published journal version. Do reference the published version if possible (see SAM Publications).

JavaScript has been disabled in your browser