TY - GEN
T1 - Detecting problematic lookup functions in spreadsheets
AU - Hermans, Felienne
AU - Aivaloglou, Efthimia
AU - Jansen, Bas
PY - 2015/12/14
Y1 - 2015/12/14
N2 - Spreadsheets are used heavily in many business domains around the world. They are easy to use and as such enable end-user programmers to and build and maintain all sorts of reports and analyses. In addition to using spreadsheets for modeling and calculation, spreadsheets are often also used for creating reports and dashboards: combining data from different sources and creating overviews. For this, lookup functions can be used: they search for a value in a range and return a corresponding row or column. Lookup functions are common: according to recent research the VLOOKUP is the fifth most common Excel function. In this paper we investigate the use of lookup functions in more detail. We analyze lookup functions within the newly released Enron spreadsheet corpus. The results show that 1) a minority of 43% of lookup formulas use the default setting where an approximate match may be returned, 2) 77% of approximate matches are used unnecessary and 3) 23% of approximate lookups is problematic: they search over unsorted ranges, while this is specifically advised against in the specification, and might lead to wrong results.
AB - Spreadsheets are used heavily in many business domains around the world. They are easy to use and as such enable end-user programmers to and build and maintain all sorts of reports and analyses. In addition to using spreadsheets for modeling and calculation, spreadsheets are often also used for creating reports and dashboards: combining data from different sources and creating overviews. For this, lookup functions can be used: they search for a value in a range and return a corresponding row or column. Lookup functions are common: according to recent research the VLOOKUP is the fifth most common Excel function. In this paper we investigate the use of lookup functions in more detail. We analyze lookup functions within the newly released Enron spreadsheet corpus. The results show that 1) a minority of 43% of lookup formulas use the default setting where an approximate match may be returned, 2) 77% of approximate matches are used unnecessary and 3) 23% of approximate lookups is problematic: they search over unsorted ranges, while this is specifically advised against in the specification, and might lead to wrong results.
UR - http://www.scopus.com/inward/record.url?scp=84959925607&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84959925607&partnerID=8YFLogxK
U2 - 10.1109/VLHCC.2015.7357210
DO - 10.1109/VLHCC.2015.7357210
M3 - Conference contribution
AN - SCOPUS:84959925607
T3 - Proceedings of IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC
SP - 153
EP - 157
BT - Proceedings - 2015 IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC 2015
A2 - Fleming, Scott D.
A2 - Li, Zhen
A2 - Ermel, Claudia
PB - IEEE Computer Society
T2 - IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC 2015
Y2 - 18 October 2015 through 22 October 2015
ER -