Walkthrough: Using ANTS Memory Profiler to track down a memory leak in a WinForms application
This walkthrough shows how to locate a memory leak using a sample application called QueryBee. QueryBee is a simple WinForms application for running queries against SQL Server databases. It has a database connection dialog and a query window to query the database. We know our application is leaking memory, because every time we open a query window and close it again, our memory usage increases.
When you open the profiler, you first see the Startup screen:
Here, there’s a list of your recent profiling sessions so you can re-run them easily. For this example, we’ll start a new session by clicking New profiling session.
The New profiling session screen is displayed:
All we need to do is point it at QueryBee, choose our performance counters, and click Start profiling.
The profiler starts up QueryBee and begins collecting performance counter data:
Taking and comparing memory snapshots is a key activity when looking for memory leaks, so our approach will be as follows:
Wait for QueryBee to open.
Take a first snapshot without using the application; this first snapshot will be used as a baseline.
Within QueryBee, perform the actions that we think cause the memory leak.
Take a second snapshot.
Examine the comparison that the profiler shows us after it has finished taking and analyzing the second snapshot.
So, QueryBee is open, sitting in our system tray.
At this point, we take a first snapshot, which we will use as a baseline for comparison with later snapshots.
When we click the Take Memory Snapshot button, the memory profiler forces a full garbage collection and takes a snapshot of the heap memory it is using.
Now, we go back to QueryBee and perform the tasks which we think cause the memory leak.
We open up QueryBee and connect to a database.
The query window opens up and we enter and execute a SQL query.
We obtain some results and close the query window.
We close the query form.
At this point, the window is gone. We expect the memory usage to fall back to where it was in the first snapshot, but that is not the case.
So what's happening here? We take a second snapshot and get the results.
A number of problems are highlighted by the summary screen.
We can see a large memory increase between snapshots, which we noticed on the timeline (top left).
The Large Object Heap appears to be fragmented, which could cause problems (top right).
The Generation 2 heap accounts for a large proportion of memory usage - often indicating objects are being held onto for longer than necessary (bottom left).
We can choose to select one of the largest classes which are shown to us in the bottom right of the screen, but instead we switch to the class list to find out more. The class list gives us a fuller picture of what's in the snapshot.
We're interested in objects which have been created since the baseline snapshot, so we need to look at types which have more instances in the second snapshot. We therefore sort by Instance Diff in decreasing order.
The String class has been placed at the top of the list, with over 300,000 new instances. We want to understand why there is such a large increase so load the Instance Categorizer for the String class by clicking the icon.
We see that over 21MB of the String class are held in memory by the same shortest path back to GC Root, via our QueryForm and ConnectForm. We select Show the Instances on this Path to view a list of every instance in the previous category.
The Instance List is showing us data which QueryBee had retrieved from the SQL Database, but that data should have been destroyed when QueryForm was closed. We select one of the instances and click the icon to generate an Instance Retention Graph.
Using the instance retention graph, we should be able to find out what is still referencing our String instances. Then, we'll be able to go back into our code to break the chain of references that is keeping them in memory.
We start at the bottom and work our way up the graph until we find a reference that needs to be broken. We'll just need to break the chain at one point to allow the garbage collector to clean up everything below that.
By navigating up, we can see the string is being held onto by QueryForm, even though that should have been released from memory. Looking a little further up, the graph is telling us that a System.EventHandler is referencing QueryForm and, if we step up one more level, it's telling us that the event handler is referenced by our ConnectForm instance – this is the form that asked us for the database connection details. In other words, the ConnectForm is holding onto the QueryForm via an Event Handler.
If we look at this node more closely, we see that it's actually being referenced by the ConnectForm's Foregrounded field.
Let's find this Foregrounded event in our code. We right-click on the QueryBee.ConnectForm node and open the ConnectForm source code in Visual Studio™.
The profiler automatically jumps to the Foregrounded event. We check where it is being used by right-clicking on Find All References.
We've got three usages and we find that the last usage is where QueryForm registers for the Foregrounded event, but it doesn't look like it unregisters. If we fix that, then the memory leak should go away.
Once we're done, we need to rebuild, but first we need to stop profiling QueryBee so that the executable isn't locked. We go back to Profiler and click on the Stop Profiling button.
Then, we rebuild.
Back in the profiler, we start up a new profiling session. We want to find out whether the reference to the QueryForm has disappeared.
Note that it remembered our settings from last time, so all we need to do is click Start Profiling.
We take a first snapshot to use as a baseline.
We perform the same actions as last time: take a baseline snapshot while QueryBee is idle, then a snapshot once we’ve connected and run a query.
We'll also take an extra snapshot, because we want to be able to verify that the QueryForm has disappeared.
Finally, we close the query window with the results grid and we take a third snapshot.
We switch to a comparison between snapshots 1 and 3, using the snapshot selection field just under the timeline.
We can see there is now only a small memory increase between the snapshots, which is promising. Let's see if there's a QueryForm still in the class list.
We switch to the class list view and search only for classes in the QueryBee namespace.
No, it's gone. We're no longer leaking the form.
As you saw, it was fairly easy to track down a form which was being leaked.