Five SQL Server changes I'd love to see (DROP_EXISTING, ALTER TABLE, STRING_SPLIT fixes)

Q: What does the DROP_EXISTING = ON error mean in SQL Server?

DROP_EXISTING = ON requires the index to already exist on the table. If the index is not present (for example on a fresh deployment to a new environment), SQL Server returns ‘Msg 7999 – Could not find any index named X for table Y’. The opposite case is also a problem: if the index exists and DROP_EXISTING is OFF, SQL Server returns ‘Msg 1913 – The operation failed because an index or statistics with name X already exists’. There is currently no CREATE OR ALTER equivalent for indexes – developers have to either drop-then-create, wrap in IF EXISTS, or manage the script with external tooling.

Q: Why is ALTER TABLE DROP COLUMN inconsistent in SQL Server?

ALTER TABLE supports DROP COLUMN but does not support a clean ‘ADD COLUMN IF NOT EXISTS’ variant. To make a column addition idempotent, you have to wrap the ALTER in an IF NOT EXISTS check against sys.columns or use dynamic SQL. DROP COLUMN on a non-existent column similarly requires a wrapper check. This asymmetry makes deployment scripts more verbose than they need to be and is a common source of developer frustration.

Q: Why does STRING_SPLIT require compatibility level 130 in SQL Server?

STRING_SPLIT was introduced in SQL Server 2016 at compatibility level 130. Databases still running on compatibility level 100 or 110 – which is common for migrated legacy databases – cannot use STRING_SPLIT without a database-level compatibility change. Raising compatibility level can have other query plan side effects that DBAs want to control independently, so the two concerns become tangled. Many developers would prefer STRING_SPLIT to be available at any compatibility level on SQL Server 2016+ instance.

Q: Can you use GROUP BY alias at the same scope in SQL Server?

No. In SQL Server, an alias defined in the SELECT list cannot be referenced in the same query’s GROUP BY, HAVING, or WHERE clauses – the alias is only visible in ORDER BY. You have to either repeat the expression or wrap the query in a subquery or CTE. Oracle allows SELECT-list aliases in GROUP BY at the same scope, which is the behaviour many SQL Server developers would like to see adopted.

In this article, SQL Server veteran Aaron Bertrand lists five SQL Server behaviors that trip developers up, and that he would like Microsoft to fix. Each section names the specific pain point with a working code example of the current behavior.

Five SQL Server changes I’d love to see

I’ve had lots of wish lists over the years, from Ladybug to Connect to UserVoice, and very few of my dreams ever come true. Still, it’s fun every once in a while, to be whimsical about the way I wished SQL Server worked. Especially in January because, you know, new beginnings.

So here are a few ideas that have crossed my mind recently:

DROP_EXISTING

I am constantly fiddling with indexes on our biggest database. It sometimes takes me 4 or 5 iterations on an index definition before I hit the sweet spot for the queries I’m targeting. If I try to create an index from a script, say, that’s in source control after a deployment to a different environment:

1	CREATE INDEX x ON dbo.foo(bar) WITH (DROP_EXISTING = ON);

Result:

Msg 7999, Level 16, State 9, Line 6
Could not find any index named 'x' for table 'dbo.foo'.

If I change the script to (DROP_EXISTING = OFF), I can create the index. But then if I want to tweak it, say, add an INCLUDE column or a filter, unless I remember to change it back to ON, now the result is:

Msg 1913, Level 16, State 1, Line 6
The operation failed because an index or statistics with name 'x' already exists on table 'dbo.foo'.

I want more convenience here, like CREATE OR ALTER PROCEDURE affords us. Instead, I have to make the script idempotent manually, e.g.

IF EXISTS (SELECT 1 FROM sys.indexes WHERE …)

CREATE INDEX … WITH (DROP_EXISTING = ON);

ELSE

CREATE INDEX … ;

That’s bulky and hard to maintain and requires you to repeat all the other properties of the index (or use dynamic SQL).. It sure would be nice to have idempotent syntax for the first and subsequent times I create a given index, like:

1	CREATE INDEX … WITH (DROP_EXISTING = IF_EXISTS);

Or even:

1	CREATE INDEX … WITH (DROP_IF_EXISTS = ON);

This would mean all of my index scripts could be structured the same!

ALTER TABLE

In a recent discussion, I was reminded that this inconsistency between adding and dropping a column still exists:

1 2	ALTER TABLE ADD <column name> …; ALTER TABLE DROP COLUMN <column name> …;

Would it kill them to make the column keyword optional?

1 2	ALTER TABLE ADD [COLUMN] <column name> …; ALTER TABLE DROP [COLUMN] <column name> …;

This way, whether you like the explicit or implicit variation, your scripts can be congruent. I think this would reduce confusion and friction for new users who can’t always get on board with the notion of “it is this way simply because it’s always been that way.” I don’t like that reason either.

STRING_SPLIT

I really don’t like that functions like STRING_SPLIT() require you use the130 compatibility level, since some holdouts could still be using 100 on SQL Server 2022 databases. Since we’re now in an era where we will keep old compat levels around forever, it would be nice to have a system function that just wrapped STRING_SPLIT().

So, you could have master in the right compatibility level and just call master.sys.string_split() and the built-in function would run in that context. We could create our own, of course, but only some of us would, and we’d all implement our versions slightly differently. And it’s amazing how many people just refuse to create their own objects for anything (see numbers and calendar tables).

Along those lines, it would be great to have wrapper functions around GENERATE_SERIES(), too. Maybe that could even translate to and from JSON or CSV.

And a generic TVP built-in so that you don’t have to create your own TVPs in every database. For example, I’d love to just have a stored procedure parameter that automatically presents as a table of int or bigint values, etc.

CREATE PROCEDURE dbo.myprocedure

@IDList sys.bigints READONLY

SELECT Id, Title

FROM Posts

WHERE Id IN (SELECT value FROM sys.bigints);

(Likewise, it would be cool if we could ditch the required READONLY keyword here, too; in general, but especially for system types.)

Fast, reliable and consistent SQL Server development…

…with SQL Toolbelt Essentials. 10 ingeniously simple tools for accelerating development, reducing risk, and standardizing workflows.

Learn more & try for free

Better statement scoping

In a recent post, I talked about how Oracle now allows you to reference GROUP BY aliases at the same scope, which would be a powerful thing to add to SQL Server (but don’t hold your breath). Itzik Ben-Gan talked about the new WINDOW clause in SQL Server 2022, which allows you to reference the same window specifications more than once without having to repeat all of the logic.

I’d love the same thing for the output of window functions, like, say:

SELECT SomethingID,

rownum = ROW_NUMBER() OVER (...),

maxdate = MAX(date) OVER (...)

FROM dbo.Something<br> WHERE rownum <= 10 AND maxdate < '20240101';

It’s cumbersome to have to create a CTE or subquery just to avoid repeating expressions… especially when the queries are much more complex than this simple example.

Selective skip rollback

And finally, I wish you could declare a table as ephemeral or a database or DML statement as “bypass_transaction_log” so that the logging you do (for example, within a trigger) could survive a rollback.

Currently, the workaround is to write your logging to a table variable, then roll back, then transfer the data from the table variable to a permanent log table. This is messy and causes maintenance issues (for example when the columns you’re logging change types). It also encourages people to use hacky ways to log data so that it can’t be rolled back or develop more cumbersome ways to implement their own equivalent to transactions.

And you?

What are some of your wish list items for how SQL Server should work?

Simple Talk is brought to you by Redgate Software

Take control of your databases with the trusted Database DevOps solutions provider. Automate with confidence, scale securely, and unlock growth through AI.

Discover how Redgate can help you

FAQs

1. What does the DROP_EXISTING = ON error mean in SQL Server?

DROP_EXISTING = ON requires the index to already exist on the table. If the index is not present (for example on a fresh deployment to a new environment), SQL Server returns ‘Msg 7999 – Could not find any index named X for table Y’. The opposite case is also a problem: if the index exists and DROP_EXISTING is OFF, SQL Server returns ‘Msg 1913 – The operation failed because an index or statistics with name X already exists’. There is currently no CREATE OR ALTER equivalent for indexes – developers have to either drop-then-create, wrap in IF EXISTS, or manage the script with external tooling.

2. Why is ALTER TABLE DROP COLUMN inconsistent in SQL Server?

ALTER TABLE supports DROP COLUMN but does not support a clean ‘ADD COLUMN IF NOT EXISTS’ variant. To make a column addition idempotent, you have to wrap the ALTER in an IF NOT EXISTS check against sys.columns or use dynamic SQL. DROP COLUMN on a non-existent column similarly requires a wrapper check. This asymmetry makes deployment scripts more verbose than they need to be and is a common source of developer frustration.

3. Why does STRING_SPLIT require compatibility level 130 in SQL Server?

STRING_SPLIT was introduced in SQL Server 2016 at compatibility level 130. Databases still running on compatibility level 100 or 110 – which is common for migrated legacy databases – cannot use STRING_SPLIT without a database-level compatibility change. Raising compatibility level can have other query plan side effects that DBAs want to control independently, so the two concerns become tangled. Many developers would prefer STRING_SPLIT to be available at any compatibility level on SQL Server 2016+ instance.

4. Can you use GROUP BY alias at the same scope in SQL Server?

No. In SQL Server, an alias defined in the SELECT list cannot be referenced in the same query’s GROUP BY, HAVING, or WHERE clauses – the alias is only visible in ORDER BY. You have to either repeat the expression or wrap the query in a subquery or CTE. Oracle allows SELECT-list aliases in GROUP BY at the same scope, which is the behaviour many SQL Server developers would like to see adopted.

This document contains proprietary information and is protected by copyright law.

Register for Simple Talk

Five SQL Server changes I’d love to see (DROP_EXISTING, ALTER TABLE, STRING_SPLIT fixes)

Five SQL Server changes I’d love to see

DROP_EXISTING

ALTER TABLE

STRING_SPLIT

Fast, reliable and consistent SQL Server development…

Better statement scoping

Selective skip rollback

And you?

Simple Talk is brought to you by Redgate Software

FAQs

1. What does the DROP_EXISTING = ON error mean in SQL Server?

2. Why is ALTER TABLE DROP COLUMN inconsistent in SQL Server?

3. Why does STRING_SPLIT require compatibility level 130 in SQL Server?

4. Can you use GROUP BY alias at the same scope in SQL Server?

Article tags

About the author

How to optimize planned availability group failovers in SQL Server

How to fix the Invalid Length error in SQL Server

How to fix the Invalid Object Name error in SQL Server

Five SQL Server changes I’d love to see (DROP_EXISTING, ALTER TABLE, STRING_SPLIT fixes)

Five SQL Server changes I’d love to see

DROP_EXISTING

ALTER TABLE

STRING_SPLIT

Fast, reliable and consistent SQL Server development…

Better statement scoping

Selective skip rollback

And you?

Simple Talk is brought to you by Redgate Software

FAQs

1. What does the DROP_EXISTING = ON error mean in SQL Server?

2. Why is ALTER TABLE DROP COLUMN inconsistent in SQL Server?

3. Why does STRING_SPLIT require compatibility level 130 in SQL Server?

4. Can you use GROUP BY alias at the same scope in SQL Server?

Article tags

Recommended

Subscribe for more

About the author

How to optimize planned availability group failovers in SQL Server

How to fix the Invalid Length error in SQL Server

How to fix the Invalid Object Name error in SQL Server