Parametric flatten T-swish: An adaptive nonlinear activation function for deep learning

The deep neural networks, these are: 1) the negative cancellation property of ReLU tends to treat negative inputs as unimportant information for the learning, resulting in performance degradation; 2) the inherent predefined nature of ReLU is unlikely to promote additional flexibility, expressivity, and robustness to the networks.

Thể loại Tài liệu miễn phí Kĩ thuật Viễn thông

Số trang 19

Ngày tạo 4/3/2023 8:41:38 PM +00:00

Loại tệp PDF

Kích thước 0.76 M

Tên tệp

Tải Parametric flatten T-swish: An adaptive nonlinear ... (.pdf)

nguon tai.lieu . vn