TabFuzz: High-level mutations for tabular data

None, None

TabFuzz: High-level mutations for tabular data

Bachelor Thesis (2021)

Author(s)

M.J.P. Smits (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Burcu Kulahcioglu Kulahcioglu Ozkan – Mentor (TU Delft - Software Engineering)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Tabular data Fuzz testing Test generation Automated testing Big data

To reference this document use:

https://resolver.tudelft.nl/uuid:9cc774eb-c4fa-4055-9f18-76f49cf65e8a

More Info

expand_more

Publication Year

2021

Language

English

Copyright

Graduation Date

25-06-2021

Awarding Institution

Delft University of Technology

Programme

['Computer Science']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Big Data is an expanding industry, yet exhaustive and automated testing of Big Data applications is still in its early stages. In the last few years, testing framework for Big Data applications have started appearing. BigFuzz is a program that uses fuzz testing for Big Data applications. Fuzz testing means generating random, potentially invalid or erroneous, inputs in attempt to find exceptions. This paper introduces TabFuzz, a tool that improves and extends the BigFuzz solution. TabFuzz reproduces the BigFuzz implementation and extends on it, by improving the generation of random input files. TabFuzz can generate a valid input file based on an input specification. It then mutates this file using high-level mutations. These mutations generate new test inputs that mimic real-world problems. This is an improvement over bit or byte level mutations. These mutations are supposed to mimic real-world problem, which is an improvement over random bit or byte level mutations. Most fuzzing programs start from a user-defined initial input file, called a seed file. TabFuzz offers the possibility to generate such a file. This research shows that these generated files are just as effective as starting from a seed file.

Files

Research_Project_FinalReport.p... (pdf)

(pdf | 0.384 Mb)

License info not available