{"id":89427,"date":"2021-01-05T00:13:04","date_gmt":"2021-01-05T00:13:04","guid":{"rendered":"https:\/\/www.red-gate.com\/simple-talk\/?p=89427"},"modified":"2026-04-14T13:12:46","modified_gmt":"2026-04-14T13:12:46","slug":"heaps-in-sql-server-part-4-pfs-contention","status":"publish","type":"post","link":"https:\/\/www.red-gate.com\/simple-talk\/databases\/sql-server\/t-sql-programming-sql-server\/heaps-in-sql-server-part-4-pfs-contention\/","title":{"rendered":"SQL Server Heap PFS Contention: Diagnosing INSERT Bottlenecks (Part 4)"},"content":{"rendered":"<p>PFS (Page Free Space) contention is a bottleneck that occurs when many concurrent sessions insert data into a SQL Server heap at the same time. SQL Server maintains PFS pages to track free space on data pages; when a new row is inserted into a heap that has no space on an existing page, a new page must be allocated and the PFS page must be updated. Under high-concurrency INSERT workloads, this PFS update becomes a serialisation point &#8211; sessions queue to update the same PFS page, causing latch waits that are visible as PAGELATCH_UP or PAGELATCH_EX waits in sys.dm_os_wait_stats. This article demonstrates the bottleneck with a concrete INSERT test, explains the data structure mechanics, and shows how row sizing and space management affect contention frequency.<\/p>\n<p><strong>The series so far:<\/strong><\/p>\n<ol>\n<li><a href=\"https:\/\/www.red-gate.com\/simple-talk\/sql\/sql-development\/heaps-in-sql-server-part-1-the-basics\/\">Heaps in SQL Server: Part 1 The Basics<\/a><\/li>\n<li><a href=\"https:\/\/www.red-gate.com\/simple-talk\/sql\/sql-development\/heaps-in-sql-server-part-2-optimizing-reads\/\">Heaps in SQL Server: Part 2 Optimizing Reads<\/a><\/li>\n<li><a href=\"https:\/\/www.red-gate.com\/simple-talk\/sql\/sql-development\/heaps-in-sql-server-part-3-nonclustered-indexes\/\">Heaps in SQL Server: Part 3 Nonclustered Indexes<\/a><\/li>\n<li>\u00a0<a href=\"https:\/\/www.red-gate.com\/simple-talk\/sql\/sql-development\/heaps-in-sql-server-part-4-pfs-contention\/\">Heaps in SQL Server: Part 4 PFS contention<\/a><\/li>\n<\/ol>\n\n<p>After looking at the internal structures and the selection of data in heaps in the previous articles, the next articles will describe how DML operations can be optimized on a heap.<\/p>\n<h2>Demo set up<\/h2>\n<p>I use data from a demo database for all demos for demonstration purposes in articles and conferences. You can download the database [CustomerOrders] <a href=\"https:\/\/bit.ly\/CustomerOrdersDB\">here<\/a>.<\/p>\n<p>In this article, I\u2019ll use an additional database, demo_db. Run the following script to create the demo_db database, a heap to test inserts, and a view pointing to [CustomerOrders].<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">CREATE DATABASE demo_db;\nGO\nUSE demo_db;\nGO\nCREATE TABLE dbo.Customers\n(\n\tId     INT          NOT NULL,\n\tName   VARCHAR(200) NOT NULL,\n\tCCode  CHAR(3)      NOT NULL,\n\tState  VARCHAR(200) NOT NULL,\n\tZIP    CHAR(10)     NOT NULL,\n\tCity   VARCHAR(200) NOT NULL,\n\tStreet VARCHAR(200) NOT NULL\n);\nGO\nCREATE VIEW dbo.CustomerAddresses\nAS\n\tSELECT\tC.Id,\n\t\tC.Name,\n\t\tA.CCode,\n\t\tA.State,\n\t\tA.ZIP,\n\t\tA.City,\n\t\tA.Street\n\tFROM\tCustomerOrders.dbo.Customers AS C\n\t\tINNER JOIN CustomerOrders.dbo.CustomerAddresses AS CA\n\t\tON (C.Id = CA.Customer_Id)\n\t\tINNER JOIN CustomerOrders.dbo.Addresses AS A\n\t\tON (CA.Address_Id = A.Id)\n\tWHERE\tCA.IsDefault = 1;\nGO<\/pre>\n<h2>Standard Procedure &#8211; INSERT<\/h2>\n<p>When data records are entered in a heap, this process consists of several individual steps that are transparent to the applications. Knowing them leaves room for possible optimization of the process.<\/p>\n<h3>Update of PFS<\/h3>\n<p>If a data row is stored in a heap and there is not enough space available on the data page, a new data page must be created. The data record can only be saved after the new page has been created.<\/p>\n<p>In the first demo, insert one row into the formerly created table from the created view.<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">CHECKPOINT;\nGO\nINSERT INTO dbo.Customers\nSELECT\t*\nFROM\tdbo.CustomerAddresses\nWHERE\tId = 1;\nGO<\/pre>\n<p>The above example adds a new record from an existing data source to the new table. Since the table was previously empty, the table structure must first be created. The undocumented function sys.fn_dblog () can be used to determine which tasks Microsoft SQL Server had to perform to insert the record into the table. I used CHECKPOINT to eliminate previous operations from appearing in the results below.<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">SELECT\tROW_NUMBER() OVER (ORDER BY [Current LSN])\t[Step #],\n\t[Current LSN],\n\tOperation,\n\tContext,\n\tAllocUnitName,\n\t[Page ID],\n\t[Slot ID]\nFROM\tsys.fn_dblog(NULL, NULL)\nWHERE\tCONTEXT &lt;&gt; N'LCX_NULL'\n\tAND AllocUnitName IS NOT NULL;\nGO<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"301\" class=\"wp-image-89428\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-23.png\" \/><\/p>\n<p><strong>Figure 1: Recording from the Transaction Log<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<p>Step(s)<\/p>\n<\/th>\n<th>\n<p>Operation and Context<\/p>\n<\/th>\n<th>\n<p>Description<\/p>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<p>1 and 2<\/p>\n<\/td>\n<td>\n<p>LOP_MODIFY_ROW \/ LCK_PFS<\/p>\n<\/td>\n<td>\n<p>Since data pages are first created for the table, each assignment must be \u201cregistered\u201d in the PFS page. A data page and the IAM page are created and registered for the table.<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>3<\/p>\n<\/td>\n<td>\n<p>LOP_FORMAT_PAGE \/ LCX_IAM<\/p>\n<\/td>\n<td>\n<p>Creation of the IAM page for table dbo.Customers<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>4<\/p>\n<\/td>\n<td>\n<p>LOP_MODIFY_ROW \/ LCX_IAM<\/p>\n<\/td>\n<td>\n<p>Registration of the first data page in IAM page<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>5 and 6<\/p>\n<\/td>\n<td>\n<p>LOP_ROOT_CHANGE \/ LCX_CLUSTERED<\/p>\n<\/td>\n<td>\n<p>Registration of table metadata in Microsoft SQL Server system tables<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>7<\/p>\n<\/td>\n<td>\n<p>LOP_FORMAT_PAGE \/ LCX_HEAP<\/p>\n<\/td>\n<td>\n<p>Preparation of the data page for the heap for storing the records.<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>8<\/p>\n<\/td>\n<td>\n<p>LOP_ROOT_CHANGE \/ LCX_CLUSTERED<\/p>\n<\/td>\n<td>\n<p>Storage of metadata in Microsoft SQL Server system tables<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>9<\/p>\n<\/td>\n<td>\n<p>LOP_INSERT_ROWS \/ LCX_HEAP<\/p>\n<\/td>\n<td>\n<p>Insert row in Heap<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>10<\/p>\n<\/td>\n<td>\n<p>LOP_SET_FREE_SPACE \/ LCX_PFS<\/p>\n<\/td>\n<td>\n<p>Update of the filling level of the data page for the PFS page<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Note:\u00a0 I describe the system pages and their functions in detail in the article \u201c<a href=\"https:\/\/www.red-gate.com\/simple-talk\/sql\/sql-development\/heaps-in-sql-server-part-1-the-basics\/\">Heaps \u2013 The Basics<\/a>\u201d.<\/p>\n<p>If further records are entered, the existing data page is filled until it is &#8211; in percentage terms &#8211; so full that no new records can be saved on it and Microsoft SQL Server has to allocate the next data page in the system.<\/p>\n<p>Run this script to add another 10,000 rows.<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">CHECKPOINT;\nGO\nDECLARE @I INT = 2\nWHILE @I &lt;= 10000\nBEGIN\n\tINSERT INTO dbo.Customers\n\tSELECT * FROM dbo.CustomerAddresses\n\tWHERE   Id = @I;\n \n\tSET @I += 1;\nEND\nGO<\/pre>\n<p>Another 10,000 records will be inserted into the table [dbo].[Customers] with the code above. Afterwards, look into the Transaction log to see the single transactional steps.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"527\" class=\"wp-image-89429\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-24.png\" \/><\/p>\n<p><strong>Figure 2: PFS updates<\/strong><\/p>\n<p>You can see that Microsoft SQL Server must update the PFS page several times (line 2, 46, 73, &#8230;). This is because the PFS page &#8211; only in the case of heaps \u2013 needs to be updated every time the next threshold is reached.<\/p>\n<h3>Bottleneck PFS<\/h3>\n<p>The PFS page \u201ccan\u201d become a bottleneck for a heap if many data records are entered in the heap in the shortest possible time. How often the PFS page has to be updated depends mostly on the data record&#8217;s size to be saved.<\/p>\n<p>This procedure does not apply to clustered indexes since data records in an index must ALWAYS be \u201csorted\u201d into the data volume according to the defined index value. Therefore, the search for a \u201cfree\u201d space is not carried out via the PFS page but via the value of the key attribute!<\/p>\n<p>Microsoft SQL Server must explicitly check after each insert process whether the PFS page needs to be updated or not. If the above result is reduced to processes on the PFS page, the process is easy to recognize.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"432\" class=\"wp-image-89430\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-25.png\" \/><\/p>\n<p><strong>Figure 3: Filtered operations from the log for PFS activity<\/strong><\/p>\n<p>In total &#8211; due to the short data record length &#8211; the PFS page had to be updated 14 times in order to enter 10,000 data records in the heap.<\/p>\n<p>At first glance, that may not seem like a lot &#8211; after all, 10,000 records were entered. However, it can become problematic for the PFS page as soon as more than one process wants to enter data in the table at the same time. To derive &#8211; imprecise due to the limitations of my test system! \u2013 a trend, I had the latches recorded on the PFS page with the help of an extended event session and then processed the above (wrapped in a stored proc) in parallel with a different number of clients.<\/p>\n<pre class=\"theme:ssms2012-simple-talk lang:tsql decode:true\">CREATE OR ALTER PROC dbo.InsertCustomerData\n\t@NumOfRecords INT\nAS\nBEGIN\n\tWHILE @NumOfRecords &gt; 0\n\tBEGIN\n\t\tINSERT INTO dbo.Customers\n\t\tSELECT * FROM dbo.CustomerAddresses\n\t\tWHERE   Id = @NumOfRecords;\n \n\t\tSET @NumOfRecords -= 1;\n\tEND\nEND\nGO\n\nCREATE EVENT SESSION [track pfs contention]\nON SERVER\nADD EVENT sqlserver.latch_suspend_end\n(\n    ACTION(package0.event_sequence)\n    WHERE\n    (\n        sqlserver.database_name = N'demo_db'\n        AND sqlserver.is_system = 0\n        AND mode &gt;= 0\n        AND mode &lt;= 5\n    )\n    AND class = 28\n    AND\n    (\n        -- only check for PFS, GAM, SGAM\n        page_id = 1\n        OR page_id = 2\n        OR page_id = 3\n        OR package0.divides_by_uint64(page_id, 8088)\n        OR package0.divides_by_uint64(page_id, 511232)\n    )\n)\nADD TARGET package0.event_file\n(\n\tSET filename = N'T:\\TraceFiles\\PFS_Contention.xel',\n\t\tMAX_FILE_SIZE = 1024,\n\t\tMAX_ROLLOVER_FILES = 10\n)\nWITH\n(\n    MAX_MEMORY = 4096KB,\n    EVENT_RETENTION_MODE = ALLOW_SINGLE_EVENT_LOSS,\n    MAX_DISPATCH_LATENCY = 5 SECONDS,\n    MAX_EVENT_SIZE = 0KB,\n    MEMORY_PARTITION_MODE = NONE,\n    TRACK_CAUSALITY = OFF,\n    STARTUP_STATE = OFF\n)\nGO\n\n<\/pre>\n<p>I carried out each series of tests five times to compensate for possible deviations. After each workload, the recordings from the extended event has been analysed with the following query:<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">SELECT\tCAST(event_Data AS xml) AS StatementData\nINTO\t#EventData\nFROM\tsys.fn_xe_file_target_read_file\n\t('T:\\TraceFiles\\PFS*.xel', NULL, NULL, NULL);\nGO\nSELECT * FROM #EventData;\nGO\nWITH XE\nAS\n(\n\tSELECT\tStatementData.value('(event\/@timestamp)[1]','datetime') AS [time],\n\t\tStatementData.value('(event\/@name)[1]', 'VARCHAR(128)') AS [Event_name],\n\t\tStatementData.value('(event\/data[@name=\"mode\"]\/text)[1]','VARCHAR(10)') AS [mode],\n\t\tStatementData.value('(event\/data[@name=\"duration\"]\/value)[1]','int') AS [duration],\n\t\tStatementData.value('(event\/data[@name=\"page_type_id\"]\/text)[1]','VARCHAR(64)') AS [page_type]\n\tFROM\t#EventData\n)\nSELECT\tXE.page_type,\n\tCOUNT_BIG(*)\t\tAS\tnum_records,\n\tSUM(XE.duration)\tAS\tsum_duration,\n\tAVG(XE.duration)\tAS\tavg_duration\nFROM\tXE\nGROUP BY\n\tXE.page_type\nGO<\/pre>\n<table>\n<tbody>\n<tr>\n<td>\n<p>Processes<\/p>\n<\/td>\n<td>\n<p>1<\/p>\n<\/td>\n<td>\n<p>2<\/p>\n<\/td>\n<td>\n<p>4<\/p>\n<\/td>\n<td>\n<p>8<\/p>\n<\/td>\n<td>\n<p>16<\/p>\n<\/td>\n<td>\n<p>32<\/p>\n<\/td>\n<td>\n<p>64<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>PFS-Contention<\/p>\n<\/td>\n<td>\n<p>0<\/p>\n<\/td>\n<td>\n<p>1<\/p>\n<\/td>\n<td>\n<p>1<\/p>\n<\/td>\n<td>\n<p>7<\/p>\n<\/td>\n<td>\n<p>7<\/p>\n<\/td>\n<td>\n<p>16<\/p>\n<\/td>\n<td>\n<p>68<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>avg. duration (\u00b5sec)<\/p>\n<\/td>\n<td>\n<p>0<\/p>\n<\/td>\n<td>\n<p>0<\/p>\n<\/td>\n<td>\n<p>27<\/p>\n<\/td>\n<td>\n<p>305<\/p>\n<\/td>\n<td>\n<p>790<\/p>\n<\/td>\n<td>\n<p>1.113<\/p>\n<\/td>\n<td>\n<p>3.446<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>Runtime (sec)<\/p>\n<\/td>\n<td>\n<p>4,28<\/p>\n<\/td>\n<td>\n<p>5,65<\/p>\n<\/td>\n<td>\n<p>7,68<\/p>\n<\/td>\n<td>\n<p>13,45<\/p>\n<\/td>\n<td>\n<p>23,83<\/p>\n<\/td>\n<td>\n<p>55,93<\/p>\n<\/td>\n<td>\n<p>165,72<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>avg (\u00b5sec)\/ row<\/p>\n<\/td>\n<td>\n<p>428<\/p>\n<\/td>\n<td>\n<p>2.825<\/p>\n<\/td>\n<td>\n<p>192<\/p>\n<\/td>\n<td>\n<p>16.813<\/p>\n<\/td>\n<td>\n<p>16.769<\/p>\n<\/td>\n<td>\n<p>17.478<\/p>\n<\/td>\n<td>\n<p>25.894<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"738\" height=\"417\" class=\"wp-image-89431\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-26.png\" \/><\/p>\n<p><strong>Figure 4: Dependence on processes to contention(s)<\/strong><\/p>\n<p>The tests I carried out are not representative because external influences were not properly isolated. Nevertheless, one can deduce from the values that the potential for contention on the PFS page escalates with an increasing number of simultaneous processes.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"671\" class=\"wp-image-89432\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-27.png\" \/><\/p>\n<p>You know the problem from everyday life; You have to queue longer the more people want to use the same resource (till in the supermarket) at the same time. The bottleneck can be rectified by working with multiple files for the filegroup in which the heap is located &#8211; as is also common practice with TEMPDB.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"188\" class=\"wp-image-89433\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-28.png\" \/><\/p>\n<p><strong>Figure 5: A separate database file for each core<\/strong><\/p>\n<p>I performed the same workload with 4 database files for the PRIMARY filegroup, and the results have been observed with the Windows Resource Manager:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"171\" class=\"wp-image-89434\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-29.png\" \/><\/p>\n<p><strong>Figure 6: Relatively even distribution of the write load &#8211; better throughput<\/strong><\/p>\n<p>BTW: Now it is a good time to learn \u201ca few\u201d german words like<\/p>\n<p>Datei = File<\/p>\n<p>Lesen = read<\/p>\n<p>Schreiben = write<\/p>\n<p>It was to be expected that this would ease the situation. You can think of it as a situation in a supermarket where only one till is open at first. As soon as there are many customers in the supermarket, it accumulates in front of the till. Several cash registers are opened, and the situation is more relaxed again.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"738\" height=\"417\" class=\"wp-image-89435\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-30.png\" \/><\/p>\n<p><strong>Figure 7: Significant relaxation for the PFS pages<\/strong><\/p>\n<h2>Bottleneck data structure<\/h2>\n<p>Anyone working with heaps must take the data structures into account. The biggest difference in the storage of data between an index and a heap is that data in a heap can be stored anywhere, while indexed tables must store the data according to the index attribute&#8217;s value. Storing data in a Heap can result in several problems:<\/p>\n<p>Waste of storage space due to the calculation of the percentage of available storage space on a data page<\/p>\n<p>Waste of memory in the buffer pool, since it is not the data itself that is loaded into the buffer pool, but the data pages on which the data is located<\/p>\n<p>Increased contention on the PFS page if data records are too large and the percentage filling level has to be updated quickly.<\/p>\n<h3>Unused memory on a data page<\/h3>\n<p>Memory is expensive and, for Microsoft SQL Server, it\u2019s an important component for fast queries. For this reason, you naturally want to avoid the situation where data pages are not completely filled, and thus RAM cannot be used.<\/p>\n<p>To demonstrate this huge discrepancy between a Heap and a Clustered Index, create in the first scenario, a Heap table with a column C2 with a fixed size of 2,000 bytes for the payload. Afterwards, a Stored Procedure inserts 10,000 rows into the Heap table.<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">USE demo_db;\nGO\nDROP TABLE IF EXISTS dbo.Customers;\nGO\n-- Create a demo table\nCREATE TABLE dbo.Customers\n(\n   C1 INT\t\tNOT NULL\tIDENTITY (1, 1),\n   C2 CHAR(2000)\tNOT NULL\tDEFAULT ('Testdata'),\n);\nGO\n-- Create stored procedure for the INSERT process\nCREATE OR ALTER PROC dbo.InsertCustomerData\n\t@NumOfRecords INT\nAS\nBEGIN\n\tWHILE @NumOfRecords &gt; 0\n\tBEGIN\n\t\tINSERT INTO dbo.Customers\n\t\t(C2)\n\t\tDEFAULT VALUES;\n\t\t\n\t\tSET @NumOfRecords -= 1;\n\tEND\nEND\nGO\n-- Execution of stored procedures for 10,000 rows\nEXEC dbo.InsertCustomerData @NumOfRecords = 10000;\nGO<\/pre>\n<p>The example above creates the table [dbo].[Customers] and a simple Stored Procedure which gets the number of rows to be inserted from a variable. After the insert process, you can get insights into the data distribution with the next query, which retrieves the physical information about the stored data.<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">SELECT\tpage_count,\n\trecord_count,\n\trecord_count \/ page_count\tAS\tavg_rows_per_page,\n\tavg_page_space_used_in_percent\nFROM\tsys.dm_db_index_physical_stats\n\t(\n\tDB_ID(),\n\tOBJECT_ID(N'dbo.Customers', N'U'),\n\tNULL,\n\tNULL,\n\tN'DETAILED'\n\t)\nWHERE\tindex_level = 0;\nGO<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1312\" height=\"100\" class=\"wp-image-89436\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-31.png\" \/><\/p>\n<p>With the table&#8217;s current design, two or three records (avg) can be stored on one data page. This means that a data page is filled with approx. 50 \u2013 75%. If you change the Heap Table to a Clustered Index Table, the results look completely different!<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk\">DROP TABLE IF EXISTS dbo.Customers;\nGO\n-- Create a demo table\nCREATE TABLE dbo.Customers\n(\n  C1  INT\t\tNOT NULL\tIDENTITY (1, 1),\n  C2  CHAR(2000)\tNOT NULL\tDEFAULT ('Testdata'),\n  CONSTRAINT pk_Customers_C1 PRIMARY KEY CLUSTERED (C1)\n);\nGO\n-- Execution of stored procedures for 10,000 rows\nEXEC dbo.InsertCustomerData @NumOfRecords = 10000;\nGO<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1305\" height=\"100\" class=\"wp-image-89437\" src=\"https:\/\/www.red-gate.com\/simple-talk\/wp-content\/uploads\/2021\/01\/word-image-32.png\" \/><\/p>\n<p>The reason for this odd behaviour is that Microsoft SQL Server references ONLY to the PFS page when it comes to the storage of a record in a Heap while a Clustered Index always has to follow the restriction of the Clustered Key and stores the record on the position of the key in the table.<\/p>\n<p>A clustered index outperforms \u2013 based on the storage consumption \u2013 the Heap due to the need to store a record based on the key attribute. But keep in mind that \u2013 different from a Heap structure \u2013 the INSERT process requires to follow the B-Tree structure when it must safe a record on a data page.<\/p>\n<p>Note<\/p>\n<p>Before you go for a Heap structure, perform some tests to understand your data distribution in the data pages!<\/p>\n<h3>Workload when inserting records<\/h3>\n<p>The following demonstration shows the dependencies between the row size and the remaining free space on a data page.<\/p>\n<pre class=\"lang:tsql theme:ssms2012-simple-talk \">IF OBJECT_ID(N'dbo.demo_table', N'U') IS NOT NULL\n\tDROP TABLE dbo.demo_table;\n\tGO\n-- The size of the column C1 will change with every test!\nCREATE TABLE dbo.demo_table (C1 CHAR(100) NOT NULL);\nGO\n-- Clear the log file for the analysis of PFS updates\nCHECKPOINT;\nGO\n-- This script will run for each test loop and insert \n-- 10,000 records into the table\nBEGIN TRANSACTION InsertRecord;\nGO\n\tDECLARE @I INT = 1;\n\tWHILE @I &lt;= 10000\n\tBEGIN\n\t    INSERT INTO dbo.demo_table(C1) VALUES ('This is a test');\n\t    SET @I += 1;\n\tEND\n\t-- Afterwards we count the log entries for the PFS updates\n\tSELECT\tContext,\n\t\tCOUNT_BIG(*)\n\tFROM\tsys.fn_dblog(NULL, NULL)\n\tWHERE\t[Transaction ID] IN \n\t\t(\n\t\t\tSELECT [Transaction ID]\n\t\t\tFROM sys.fn_dblog(NULL, NULL)\n\t\t\tWHERE\t[Transaction Name] = N'InsertRecord'\n\t\t\t\tOR Context = N'LCX_PFS'\n\t\t)\n\tGROUP BY\n\t\tContext;\n\t-- and have a look to the avg space used in the heap\n\tSELECT\tpage_count,\n\t\tavg_page_space_used_in_percent\n\tFROM\tsys.dm_db_index_physical_stats\n\t\t(\n\t\t\tDB_ID(),\n\t\t\tOBJECT_ID(N'dbo.demo_table', N'U'),\n\t\t\t0,\n\t\t\tNULL,\n\t\t\tN'DETAILED'\n\t\t);\n\tGO\nROLLBACK TRANSACTION;\nGO<\/pre>\n<p>The above demonstration has been run with different row sizes. The result of the tests with different row sizes gave the following results:<\/p>\n<p>While the duration of the transaction runtime changes moderately (157 ms &#8211; 1.459 ms), the number of updates of the PFS page increases extremely beginning with a record length of 200 bytes (563 &#8211; 16.260). Although the PFS page refresh occurs quite frequently, the number of data pages grows moderately (149-5,000). The average filling level of a data page is between 75% and 100%, depending on the size of the row.<\/p>\n<p>The PFS page&#8217;s frequent updating is explained by the growing size of a data record since fewer data records fit on one data page and the various thresholds can be reached more quickly.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<p>Record Length<\/p>\n<\/td>\n<td>\n<p>Time (ms)<\/p>\n<\/td>\n<td>\n<p>PFS Update<\/p>\n<\/td>\n<td>\n<p>Pages<\/p>\n<\/td>\n<td>\n<p>Avg. Used space<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>100<\/p>\n<\/td>\n<td>\n<p>157<\/p>\n<\/td>\n<td>\n<p>563<\/p>\n<\/td>\n<td>\n<p>149<\/p>\n<\/td>\n<td>\n<p>90,36%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>200<\/p>\n<\/td>\n<td>\n<p>414<\/p>\n<\/td>\n<td>\n<p>1,397<\/p>\n<\/td>\n<td>\n<p>271<\/p>\n<\/td>\n<td>\n<p>95,26%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>500<\/p>\n<\/td>\n<td>\n<p>441<\/p>\n<\/td>\n<td>\n<p>3,199<\/p>\n<\/td>\n<td>\n<p>777<\/p>\n<\/td>\n<td>\n<p>80,91%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>1000<\/p>\n<\/td>\n<td>\n<p>595<\/p>\n<\/td>\n<td>\n<p>5,916<\/p>\n<\/td>\n<td>\n<p>1,436<\/p>\n<\/td>\n<td>\n<p>86,79%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>2000<\/p>\n<\/td>\n<td>\n<p>920<\/p>\n<\/td>\n<td>\n<p>10,625<\/p>\n<\/td>\n<td>\n<p>3,339<\/p>\n<\/td>\n<td>\n<p>74,31%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>3000<\/p>\n<\/td>\n<td>\n<p>1,138<\/p>\n<\/td>\n<td>\n<p>16,069<\/p>\n<\/td>\n<td>\n<p>5,004<\/p>\n<\/td>\n<td>\n<p>74,27%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>4000<\/p>\n<\/td>\n<td>\n<p>1,459<\/p>\n<\/td>\n<td>\n<p>16,260<\/p>\n<\/td>\n<td>\n<p>5,000<\/p>\n<\/td>\n<td>\n<p>99,04%<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Let\u2019s do a little maths when data are stored on a data page.<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<p>Bytes<\/p>\n<\/th>\n<th>\n<p>50%<\/p>\n<\/th>\n<th>\n<p>80%<\/p>\n<\/th>\n<th>\n<p>95%<\/p>\n<\/th>\n<th>\n<p>100%<\/p>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<p>100<\/p>\n<\/td>\n<td>\n<p><strong>40<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>64<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>76<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>80<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>200<\/p>\n<\/td>\n<td>\n<p><strong>20<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>32<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>38<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>40<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>500<\/p>\n<\/td>\n<td>\n<p><strong>8<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>12<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>15<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>16<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>1000<\/p>\n<\/td>\n<td>\n<p><strong>4<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>6<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>7<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>8<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>2000<\/p>\n<\/td>\n<td>\n<p><strong>2<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>3<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>3<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>4<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>3000<\/p>\n<\/td>\n<td>\n<p><strong>1<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>2<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>2<\/strong><\/p>\n<\/td>\n<td>\n<p><strong>2<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<p>4000<\/p>\n<\/td>\n<td>\n<p><strong>1<\/strong><\/p>\n<\/td>\n<td>\u00a0<\/td>\n<td>\u00a0<\/td>\n<td>\n<p><strong>2<\/strong><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The above table shows the maximum records which \u201cshould\u201d fit on ONE data page when the threshold has exceeded. Please note that with a fill level of 95%, only <strong>403 bytes<\/strong> (8.060 * (1-95%)) are mathematically available on the data page.<\/p>\n<p>If the row size is 100 Bytes, Microsoft SQL Server can store 40 records on ONE data page before the threshold gets updated to 80%. It takes 24 more records before the next update to 95% will happen.<\/p>\n<p>As bigger the row size is as faster will the thresholds be reached. Keep in mind that the row size has an direct impact on the possible contention on the PFS page.<\/p>\n<p>Let\u2019s take a row size of 1,000 bytes for a record. With the 5<sup>th<\/sup> record, the PFS gets updated to 80%. When the 6<sup>th<\/sup> row (1,000 Bytes) must be stored on a data page, it will fit perfectly. From the table above, you can see the green and red values.<\/p>\n<p>The green values mean that the records can be stored on the data page while the red ones show the records which will request a new data page!<\/p>\n<h2>Summary<\/h2>\n<p>The aim when inserting new data in a heap is to avoid frequent updates of the PFS pages and to use the available space as max as possible. The next article will show how you can boost the performance when you insert data into a Heap.<\/p>\n\n\n<section id=\"faq\" class=\"faq-block my-5xl\">\n    <h2>FAQs: Heaps in SQL Server: Part 4 PFS contention<\/h2>\n\n                        <h3 class=\"mt-4xl\">1. What is PFS contention in SQL Server heaps?<\/h3>\n            <div class=\"faq-answer\">\n                <p>PFS (Page Free Space) contention occurs when many concurrent sessions are inserting rows into a heap table and all need to update the same PFS page simultaneously. Each INSERT into a heap that requires a new page must update the PFS tracking page for that extent. The PFS page update is protected by a latch; under high concurrency, sessions queue on this latch, serialising what should be parallel inserts. The symptom is PAGELATCH_UP or PAGELATCH_EX waits visible in sys.dm_os_wait_stats.<\/p>\n            <\/div>\n                    <h3 class=\"mt-4xl\">2. How do I diagnose PFS contention on a SQL Server heap?<\/h3>\n            <div class=\"faq-answer\">\n                <p>Query sys.dm_os_wait_stats for PAGELATCH_UP or PAGELATCH_EX waits. If these are high and the bottlenecked object is a heap table (visible in sys.dm_exec_requests or Extended Events), PFS contention is likely. You can confirm by capturing the page latch resource description &#8211; it will identify the specific PFS page being contended. The demo in this article shows the contention signature using a concurrent INSERT test on a heap.<\/p>\n            <\/div>\n                    <h3 class=\"mt-4xl\">3. How do I fix SQL Server heap PFS contention?<\/h3>\n            <div class=\"faq-answer\">\n                <p>The most effective approaches are: (1) Convert the heap to a clustered index &#8211; indexed tables use GAM\/SGAM pages for allocation rather than PFS-heavy heap allocation; (2) Increase the row size to reduce how many rows fit per page and therefore reduce PFS update frequency; (3) Pre-allocate pages using trace flag 1118 (SQL Server 2012 and earlier) to reduce mixed-extent allocation. For INSERT-heavy workloads, a clustered index with a monotonically increasing key (identity, sequence) is the standard recommendation.<\/p>\n            <\/div>\n                    <h3 class=\"mt-4xl\">4. When does SQL Server use a heap versus a clustered index for storage?<\/h3>\n            <div class=\"faq-answer\">\n                <p>SQL Server stores data in a heap when a table has no clustered index. Heaps are sometimes used intentionally for staging tables, bulk load landing tables, or tables with extremely high INSERT rates where clustered index key contention would be worse than PFS contention. Clustered indexes are the default recommendation for most production tables because they provide ordered storage, efficient range scans, and avoid heap allocation bottlenecks like PFS contention and forwarding record overhead.<\/p>\n            <\/div>\n            <\/section>\n","protected":false},"excerpt":{"rendered":"<p>PFS (Page Free Space) contention is a heap INSERT bottleneck in SQL Server that occurs during high-concurrency workloads. Learn how PFS updates trigger page allocation, why contention forms, and how to structure heap inserts to minimize it. Part 4 of the Heaps series.&hellip;<\/p>\n","protected":false},"author":205038,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":true,"footnotes":""},"categories":[53,143531],"tags":[101935,126398,4206,126397,5842,4151],"coauthors":[19794],"class_list":["post-89427","post","type-post","status-publish","format-standard","hentry","category-featured","category-t-sql-programming-sql-server","tag-heaps","tag-inserting-rows","tag-performance","tag-pfs-contention","tag-sql-monitor","tag-sql-server"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/posts\/89427","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/users\/205038"}],"replies":[{"embeddable":true,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/comments?post=89427"}],"version-history":[{"count":8,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/posts\/89427\/revisions"}],"predecessor-version":[{"id":109618,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/posts\/89427\/revisions\/109618"}],"wp:attachment":[{"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/media?parent=89427"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/categories?post=89427"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/tags?post=89427"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.red-gate.com\/simple-talk\/wp-json\/wp\/v2\/coauthors?post=89427"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}