Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.12202/8962
Title: A practical introduction to data preprocessing
Authors: Feltenberger, Dave
Motechin, Jonathan
Keywords: machine learning model
HHD dataset
Issue Date: May-2023
Publisher: Yeshiva University
Citation: Motechin, J. (2023, May). A practical introduction to data preprocessing [Unpublished undergraduate honors thesis, Yeshiva University].
Series/Report no.: Jay and Jeanie Schottenstein Honors Program;May 2023
Abstract: When the input data of a machine learning model undergoes the proper preprocessing, it can drastically affect the results. In Ben Gurion University’s paper about the HHD dataset, they discuss how their CNN model achieved 72.57% accuracy. When the same dataset was used to train a CNN after using the preprocessing steps mentioned above, the model achieved 84% accuracy. While Ben Gurion University does not discuss the preprocessing used in their model, it is clear that proper preprocessing of the data will play a considerable role in the success of the model. (from Conclusion)
Description: Undergraduate honors thesis / Opt-Out
URI: https://hdl.handle.net/20.500.12202/8962
Appears in Collections:Jay and Jeanie Schottenstein Honors Student Theses

Files in This Item:
File Description SizeFormat 
Jonathan Motechin Thesis OptOut opt.pdf
  Restricted Access
286.94 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons