This is the artifact for the paper "Causal Testing: Understanding Defects' Root Causes". This artifact facilitates the use of Holmes for debugging defects and re-use of the implementation and/or artifacts in other research studies.
Our artifact includes the following:
-
Holmes, our prototype Causal Testing implementation.
-
A virtual machine that facilitates the use of Holmes, our Causal Testing prototype.
-
The supplemental data collection materials used when evaluating Holmes.
All the above listed artifacts, with the exception of the virtual machine file, are located in this repository.
Copied from INSTALL.md
- Download VirtualBox.
- Download virtual machine file CausalTesting_Artifact.ova.
Please note this is a large file (5BG) and may some time to download. - Open VirtualBox.
- Go to File > Import Appliance...
- Find and select the downloaded virtual machine file (CausalTesting_Artifact.ova). Click "Continue".
- Leave all the settings as they are and click "Import".
Once the virtual machine is imported, it will appear in your VirtualBox Manager as CausalTesting_Artifact.
You can now start the virtual machine by clicking the green "Start" arrow at the top of the VirtualBox Manager (see screenshot below).
If the virtual machine loads and you find it is not an appropriate size, you can find the size that works for you by doing the following:
- In your VirtualBox VM menu, go to View > Virtual Screen 1 > .... You will see different scaling options; select the one that best suites your screen.
- Your menu may look different if you are running a different operating system. However, if given percentages to re-scale they will have the same effect.
If you leave the virtual machine and return to a black screen or screen saver, press any key on your keyboard to wake up the virtual machine.
Once the virtual machine loads, Eclipse will open. In Eclipse, there will be 8 projects loaded on the left side in the Project Explorer. Seven of the projects are from the Defects4J defect benchmark; these projects are labeled Defect_0_Training, Defect_1, Defect_2, Defect_3, Defect_4, Defect_5, and Defect_6. The eighth project is Holmes.
At the bottom of the window, the Tasks View is open with a list of TODOs. Each TODO maps to a failing test in its respective project that exposes a defect in that project's source code. For example TODO: Test 00 (Training) maps to a failing test in the project Defect_0_Training that exposes a defect.
You may get a "Low Disk Space" warning -- you can click "Ignore". Running Holmes doesn't require a significant amount of memory.
The runtime for Holmes varies depending on a number of factors, such as input value, input difference threshold, and project size. The current version of Holmes works on tests for single parameter methods that take either a primitive type or String. To replicate Holmes' runtime when generating tests, do the following:
- Double-click the TODO labeled TODO: Test 00 (Training). Within a few seconds, the file BooleanUtilsTest.java opens at the
test_toBoolean_String()
method. Inside thetest_toBoolean_String()
method is the failing testassertEquals(false, BooleanUtils.toBoolean("tru");
.
- Double-click the method call
toBoolean
so that it is highlighted, as show in the screenshot below.
- Right-click the highlighted method and click "Run Holmes" in the pop-up menu (shown below). The editor will automatically go to the top of the file and some dialog windows may pop up as Holmes generates and executes tests. This process will take a minute or two; to reduce the chances of Holmes or the virtual machine hanging, we selected a defect for which Holmes is quickly able to find similar passing tests.
Eventually, a view labeled "Holmes View" will open at the bottom of the screen with the results of the execution (as shown below).
Now let's see how we can use Causal Testing to debug.
Holmes is a prototype implementation of our novel testing technique, Causal Testing. Causal Testing conducts causal experiments, which involves perturbing test inputs to find passing executions that are similar to the failing execution, to help developers understand why a given test is failing and how to fix it.
To see how you can use Causal Testing to debug a failing test, we first need to produce the output:
- Double-click the TODO labled TODO: Test 01. This will open the file StringEscapeUtilsTest.java at the
testEscapeJavaWithSlash()
test method. Inside the test method is the following failing test:
String input = "String with a slash (/) in it";
final String expected = input;
final String actual = StringEscapeUtils.escapeJava(input);
assertEquals(expected, actual);
- For this test, the method call being tested is
escapeJava(input)
. Therefore, to invoke Holmes we want to double-click to highlightescapeJava
, as shown below.
- Right click the highlighted method and select Run Holmes from the pop-up menu. The output will appear at the bottom of the screen in the "Holmes View"
Now that we have the Causal Testing results, we can begin to debug the defect.
First, we can see that Holmes has provided three similar passing tests and three similar failing tests. Just from looking at the inputs to the tests that pass and the tests that fail we can see that, like the original failing test, all the additional failing tests include the /
character while the tests that pass do not. This suggests the defect has something to do with the presence of the /
in the input string.
Second, we can see that each generated test has a button under it labeled "See Execution Trace". Clicking this button opens a minimized trace of the execution; clicking the button again hides the trace.
Let's look at the trace for the most similar passing input, "String with a slash in it"
:
We can see that as expected, the final method call to escapeJavaStyleString
returns the same string that was input, as the test expected (assertEquals(expected,actual)
). When we look at the trace for the original failing test, we can see that this same method call adds an additional character to the input string (String with a slash (\/) in it
), causing the test to fail.
Based on this information, we can hypothesize that the cause of this defect is the mishandling of the /
character in the escapeJavaStyleString
method. Now let's see if we can find where this happening and how we can fix it.
To navigate to this final method call escapeJavaStyleString
in Eclipse, do the following:
- Press and hold the control button on your keyboard and hover over the
escapeJava
method call in the test method. The method should become a link and open a pop-up menu, as shown in the screenshot below.
-
Click "Open Declaration" in the pop-up menu. This will take you to the
escapeJava
method implementation in theStringEscapeutils
class. -
Still following the trace provided by Holmes, we repeat Step 1 on the
escapeJavaStyleString
method call inside theescapeJava
method. -
The trace indicates that is one more method call to another method named
escapeJavaStyleString
inside theescapeJavaStyleString
method. We repeat Step 1 one last time on this last call toescapeJavaStyleString
and find ourselves in the final method of the execution.
Knowing that the part of the input causing the problem is the /
character, we can begin by skimming the method for where this character is (or isn't) handled. Eventually we get to line 243, where we see the case for the /
character and we can see where the extra character is coming from:
case '/':
out.write("\\");
out.write("/");
Given this code exists, we can assume that just erasing this case may fix this defect but cause problems in other tests or parts of the code. Therefore, we need a fix that makes it so this case does not execute when called from escapeJava
.