dc.contributor.author | Kaytan, Mustafa | |
dc.contributor.author | Aydilek, İbrahim Berkan | |
dc.contributor.author | Yeroğlu, Celaleddin | |
dc.date.accessioned | 2024-01-15T12:11:38Z | |
dc.date.available | 2024-01-15T12:11:38Z | |
dc.date.issued | 2023 | en_US |
dc.identifier.citation | Kaytan, M., Aydilek, İ.B., Yeroğlu, C. (2023). Gish: a novel activation function for image classification. Neural Computing and Applications, 35 (34), pp. 24259-24281.
https://doi.org/10.1007/s00521-023-09035-5 | en_US |
dc.identifier.issn | 0941-0643 | |
dc.identifier.issn | 1433-3058 | |
dc.identifier.uri | https://doi.org/10.1007/s00521-023-09035-5 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12508/3005 | |
dc.description.abstract | In Convolutional Neural Networks (CNNs), the selection and use of appropriate activation functions is of critical importance. It has been seen that the Rectified Linear Unit (ReLU) is widely used in many CNN models. Looking at the recent studies, it has been seen that some non-monotonic activation functions are gradually moving towards becoming the new standard to improve the performance of CNN models. It has been observed that some non-monotonic activation functions such as Swish, Mish, Logish and Smish are used to obtain successful results in various deep learning models. However, only a few of them have been widely used in most of the studies. Inspired by them, in this study, a new activation function named Gish, whose mathematical model can be represented by y=x·ln(2-e-ex) , which can overcome other activation functions with its good properties, is proposed. The variable x is used to contribute to a strong regulation effect of negative output. The logarithm operation is done to reduce the numerical range of the expression (2-e-ex) . To present our contributions in this work, various experiments were conducted on different network models and datasets to evaluate the performance of Gish. With the experimental results, 98.7% success was achieved with the EfficientNetB4 model in the MNIST dataset, 86.5% with the EfficientNetB5 model in the CIFAR-10 dataset and 90.8% with the EfficientNetB6 model in the SVHN dataset. The obtained performances were shown to be higher than Swish, Mish, Logish and Smish. These results confirm the effectiveness and performance of Gish. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Springer | en_US |
dc.relation.isversionof | 10.1007/s00521-023-09035-5 | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Convolutional neural network | en_US |
dc.subject | Gish | en_US |
dc.subject | Image classification | en_US |
dc.subject | Nonmonotonic activation function | en_US |
dc.subject.classification | Object Detection | |
dc.subject.classification | Deep Learning | |
dc.subject.classification | IOU | |
dc.subject.classification | Electrical Engineering, Electronics & Computer Science
- Computer Vision & Graphics
- Genome Rearrangement | |
dc.subject.other | Convergence | |
dc.subject.other | Networks | |
dc.subject.other | Neurons | |
dc.subject.other | Speed | |
dc.subject.other | Chemical activation | |
dc.subject.other | Convolution | |
dc.subject.other | Convolutional neural networks | |
dc.subject.other | Deep learning | |
dc.subject.other | Neural network models | |
dc.subject.other | Activation functions | |
dc.subject.other | Convolutional neural network | |
dc.subject.other | Gish | |
dc.subject.other | Images classification | |
dc.subject.other | Linear units | |
dc.subject.other | Monotonics | |
dc.subject.other | Neural network model | |
dc.subject.other | Nonmonotonic | |
dc.subject.other | Nonmonotonic activation function | |
dc.subject.other | Performance | |
dc.subject.other | Image classification | |
dc.title | Gish: a novel activation function for image classification | en_US |
dc.type | article | en_US |
dc.relation.journal | Neural Computing and Applications | en_US |
dc.contributor.department | Mühendislik ve Doğa Bilimleri Fakültesi -- Bilgisayar Mühendisliği Bölümü | en_US |
dc.identifier.volume | 35 | en_US |
dc.identifier.issue | 34 | en_US |
dc.identifier.startpage | 24259 | en_US |
dc.identifier.endpage | 24281 | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.contributor.isteauthor | Yeroğlu, Celaleddin | |
dc.relation.index | Web of Science - Scopus | en_US |
dc.relation.index | Web of Science Core Collection - Science Citation Index Expanded | |