15 February 2021

Guest post

This is a guest post from Phil Factor. Phil Factor (real name withheld to protect the guilty), aka Database Mole, has 30 years of experience with database-intensive applications.

Despite having once been shouted at by a furious Bill Gates at an exhibition in the early 1980s, he has remained resolutely anonymous throughout his career.

He is a regular contributor to Simple Talk and SQLServerCentral.

Phil Factor

15 February 2021

Getting Data In and Out of SQL Server Flyway Builds

This article provides SQL "bulk copy" routines to extract data from and load it into a SQL Server database, rapidly, and demonstrates a way to automatically bulk insert the test data in a Flyway build, using a SQL callback.

Guest post

This article is part of a series on managing test data in a Flyway development:

Also relevant are the series of related articles on Database testing with Flyway.

I’ll start by briefly reviewing a strategy for storing datasets that maintains separation of data and metadata, and then get into the practicalities of bulk import and export of data, during Flyway development. Row-by-row inserts are fine for smaller data volumes, but are not generally fast enough for a build-and-fill operation. If you want to do ‘builds-while-you-wait’, then you’ll need bulk inserts, unless you want to wait hours. Finally, it makes the build script files much shorter.

I’ll assume you have a SQL Server database at the right version, stocked with the data you need, and I’ll demonstrate SQL routines to extract and load the data, using BCP, and then provide a SQL callback script that will automatically bulk insert the test data as soon as Flyway successfully builds the database.

The separation of data and metadata

When you are building a database of any size, you’ll probably have a policy of separating data from metadata. It is a lot easier, and it gives you more flexibility to build the metadata to the correct level and then insert the data. There are several reasons for this. For a start, you shouldn’t store data, other than the barest essential of enumeration data, in database source control: it is fine for data that requires change control and a new version of the database, but that is a rare event.

Data should almost never be handled in the same way as a structural change. Data is under a very different type of control and has entirely separate repercussions for auditing. It should not be in source control, because it may contain personal or sensitive information that must be curated in a very different way to the application source. It will probably need redaction or masking before it can be used in development. If you have data that is part of the database, then the table that contains it is best handled as a view with the literal data rendered in multi-line VALUE statements so that it is read-only and cannot be changed accidentally.

Storing datasets

The management of data sets will require a bit of thought. In this example, I store them on the SQL Server hosting the source database. The directory is structured by project name, usually the name of the database. By storing them on the server, it means that you don’t have to give the SQL Server login any network access. You can, if you need to, copy the data sets to your project folders stored somewhere on the local area network, as a separate task. However, I’d advise instead that you to leave them on the server hosting the SQL Server and share them to a restricted list of users.

Maintaining datasets per database version

To demonstrate this in Flyway, we need to be aware that a user might need any of the current versions of the database. These are likely to have different ways of arranging the data, particularly in the early stages of development. We’ll therefore save the table data in a different directory for each change to the tables, such as a column change. We’ll copy the Flyway convention and save each dataset in a directory whose name is the starting version number that is appropriate. The same dataset may well be appropriate for several versions, so when we select a dataset, we choose the most recent version before or equal to the version of the database we need to build. By doing this we use the minimum necessary disk space for our data sets with the smallest amount of duplication.

To do this, we need to compare standard version numbers. If your language supports the comparison of arrays, the comparison of standard version numbers is trivial. We need to do this in SQL for this exercise, so it isn’t quite so easy.

Multiple datasets at the same version

You may need several datasets. If you are doing testing, you’ll probably have a standard dataset that allows you to compare the result of a process with a version that has been verified as being correct. You might also want a data set with the largest likely predicted volume of data to check scalability. You might also want a small dataset for a quick test cycle of basic function. This can be handled by Flyway, but is out of the scope of this article, because so much depends on the way you store your development data.

Writing data out and reading data in

You will need two routines. They should not be part of the database but held separately as deployment scripts. The first one will extract the data from a database at the correct version, using BCP. You run this batch ad-hoc, as required. With SQL Server, you will probably want to choose native Unicode format to store your data.

The second routine loads the data into an empty database build, of the same version. It will be in a Flyway SQL callback, executed when the migration ends. It will need to know what version the migration got to and then access the corresponding dataset. Our data folder will need a dataset for every table change, so that each subdirectory holds a dataset that is appropriate for one or more revisions, until superseded by a higher version.

Extract the data from the source database using BCP

The first SQL script is to extract data from the ‘donor’ or source database. To output the contents of every table of a source database, as a series of ‘wide’ native BCP files, is pretty trivial in a development environment. You might have used SQL Data Generator to create data, or you could have used SQL Data Masker. If you are using unmasked ‘production’ data, the task becomes less trivial and requires controls.

You will need a login that is powerful enough to have write-access to the file system of the server that hosts SQL Server. The login must also be able to temporarily allow SQL Server to execute DOS commands. This is relatively simple in code if you are a system admin (sa). If you haven’t got this sort of superpower, then you’ll have to get a DBA to set up a task to do this, perhaps on SQL Server Agent.

Here is the script. The @CurrentDatabase should be a copy of the database from which you wish to write out the data. I’m using AdventureWorks2016. The data folder location will be a subdirectory of ‘C:\BCPData’, called <dbname>\databaseContents\VersionNumber, where <dbname> is the value assigned to @SourceDatabase, which is the project folder name for this database. In this example, the @SourceDatabase is just taken from the current database, AdventureWorks2016, but in most cases you would want to use a more generic project name such as AdventureWorks.

The script will first try to grab the version number of that database from the flyway_schema_history table, if it’s a Flyway-controlled database. If there is no flyway history table, it will then check for a version stored in an extended property. If neither exist, it just assigns a version number of your choosing and which you need to add to the code.

Note that if you are getting data from a Flyway database, you need to prevent it copying out the data from the Flyway schema history table, because you really don’t want it copied into the target database(s) This may not be in dbo so you need to replace that with its chosen schema. If you execute this within Flyway you can, of course, use the placeholder – ${flyway:defaultSchema}.

/* It is important to declare your destination for the data. This is slightly

complicated by the fact that you may want to take data from slightly differently

named databases from the 'project' name for the database. Like 'Adventureworks' might

be your 'datasource' name and you might take data from 'currentDatabase'

Adventureworks2016.

If you want to add a directory with the name of the datasource database

(the data donor) then add the macro <dbname>. The application will swap that

out for the Data source name. The system obligingly tries to create the directory

if it doesn't exist so with great power comes a certain level of responsibility:

Well, it should do. */

--this first line must be changed to suit your setup

DECLARE @DataFolderLocation sysname = 'C:\BCPData\<dbname>databaseContents';

DECLARE @CurrentDatabase sysname=Db_Name();--this database. eg PubsTest

--this name must be the name of the project database.

Declare @DataSource sysname=Db_Name();--the project name of the database e.g. Pubs

--automatically substitute the current database name

Select @DataFolderLocation = replace(@DataFolderLocation,'<dbname>',@DataSource+'\')

-- you may occasionally have a database of a different name with the data.

-- To allow advanced options to be changed.

EXEC sp_configure 'show advanced options', 1;

-- To update the currently configured value for advanced options.

RECONFIGURE WITH OVERRIDE;

-- To enable the feature.

EXEC sp_configure 'xp_cmdshell', 1;

-- To update the currently configured value for this feature.

RECONFIGURE WITH OVERRIDE;

Declare @Version Nvarchar(40)

/* first we read the flyway schema history to detect what version

the database needs to be at. */

if object_id('dbo.flyway_schema_History') is not null --is it a flyway database

Begin

SELECT @Version=[version] --we need to find the greatest successful version.

FROM dbo.flyway_schema_History --

WHERE installed_rank =

(--get the PK of the highest successful version recorded

SELECT Max(Installed_Rank)

FROM dbo.flyway_schema_History

WHERE success = 1);

end

else --Ah. This isn't a flyway-versioned database

begin --let's hope it has an extended property with it.

Declare @MaybeWeGotAVersion nvarchar(max)

SELECT @MaybeWeGotAVersion= convert(nvarchar(max),fn_listextendedproperty.value)

FROM sys.fn_listextendedproperty(

N'Database_Info', DEFAULT, DEFAULT, DEFAULT, DEFAULT, DEFAULT, DEFAULT

)

Select @Version=coalesce(JSON_VALUE ( @MaybeWeGotAVersion , N'lax $[0].Version' ),'1.1.3You add any prior migration files (previous versions), renamed to the flyway convention, ')

end --we default to

/* We now check to see if the path exists. If it doesn't, then we create it. */

DECLARE @command NVARCHAR(4000);

Select @command= 'if not exist "'+@DataFolderLocation+'\" mkdir "'+@DataFolderLocation+'"'

--Select @command

execute xp_cmdshell @command

Select @command= 'if not exist "'+@DataFolderLocation+'\V'+@version+'\" mkdir "'+@DataFolderLocation+'\V'+@version+'"'

--Select @command

execute xp_cmdshell @command

/* Dont write out the flyway_schema_history table. It is just too risky */

Select @command='

if ''?''<>''[dbo].[flyway_schema_history]''

begin

Print ''writing out '+@CurrentDatabase+'.?''

execute xp_cmdshell ''bcp '+@CurrentDatabase+'.? OUT '+@DataFolderLocation+'\V'+@version+'\'+@DataSource+'-?.bcp -T -N''

end'

--Select @command

EXEC sp_MSforeachtable @command;

-- we now prevent lesser mortals from using this feature now that we're finished.

EXEC sp_configure 'xp_cmdshell',0;

Load the data into the target database from the BCP data files

We have a parallel batch to read it into your currently empty copy of the database, at the same version. If this database is on another server, you’ll need to sync the data files over to it. You can also use a network attach but remember to alter the value in the data folder location (@DataFolderLocation) accordingly.

The script will select the data files in the folder matching the version of the target database, or lower. Before loading the data, it disables all constraints, then after the BULK INSERT, reenables them, checking there are no violations.

As this is executed by Flyway, I’ve added two placeholders:

${flyway:defaultSchema} – a default placeholder that specifies the schema in which the flyway history table is placed
${datasource} – a custom placeholder for the project name of the database, whose value is passed to Flyway on the command line, when we run the build.

If you use SSMS, you’ll need to remove these placeholders, but if you use SQLCMD mode of SQLCMD, you can change the placeholders slightly to provide values using the setvar command. for example: setvar dataSource "AdventureWorks2016 "

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

/* if your data is in a directory that uses the name of the datasource database,

use the macro '<dbname>' to match the data output routine.

if you change the name of the database to the source

of the data, remember to specify it without the macro and with the trailing

backslash as normal */

DECLARE @DataFolderLocation sysname = 'C:\BCPData\${datasource}\databaseContents';

DECLARE @CurrentDatabase sysname=db_name();--this database. eg PubsTest

Declare @DataSource sysname='${datasource}'; --'MyProject';

--the project name of the database e.g. Pubs

DECLARE @command NVARCHAR(4000);

DECLARE @CorrectVersion NVARCHAR(40);

DECLARE @DirListingOutput TABLE (Rawinput NVARCHAR(255)); --e.g. 1.1.6

/* read into a table variable the available subdirectories of data via

a CmdShell command */

--read in a list of all the diirectories

SELECT @command='dir '+@DataFolderLocation+'\v* /a:d /b';

INSERT INTO @DirListingOutput(Rawinput)

EXECUTE xp_cmdshell @command;

/* extract just the directory names with legal 'semantic Versions'*/

DECLARE @SemanticVersion TABLE

(

TheVersion NVARCHAR(30),

TheType CHAR(12),

SoFar NVARCHAR(30),

Major INT,

Minor INT,

Patch INT

);

Print 'getting Data from C:\BCPData\${datasource}\databaseContents'

/* Put into our version table all the available folders with their versions */

INSERT INTO @SemanticVersion (TheVersion, TheType)

SELECT Substring(Rawinput, PatIndex('%V%.%.%', Rawinput)+1, 30), 'folder'

FROM @DirListingOutput

WHERE Rawinput LIKE '%V%.%.%';

IF @@RowCount = 0

RAISERROR('Sorry, but %s doesn''nt have any data folders at ${datasource} of the right format (Vd.d.d)',

16,1,@DataFolderLocation);

/* first we read the flyway schema history to detect what version

the database needs to be at. */

if object_id('dbo.flyway_schema_History') is not null

INSERT INTO @SemanticVersion (TheVersion, TheType)

SELECT [version], 'ourVersion' --we need to find the greatest successful version.

FROM ${flyway:defaultSchema}.flyway_schema_History --

WHERE installed_rank =

( --get the PK of the highest successful version recorded

SELECT Max(Installed_Rank) FROM ${flyway:defaultSchema}.flyway_schema_History WHERE success = 1

);

else --Uhoh. looks like he's not using Flyway. Is he useing an Extended property?

begin

Declare @MaybeWeGotAVersion nvarchar(max)

SELECT @MaybeWeGotAVersion= convert(nvarchar(max),fn_listextendedproperty.value)

FROM sys.fn_listextendedproperty(

N'Database_Info', DEFAULT, DEFAULT, DEFAULT, DEFAULT, DEFAULT, DEFAULT

)

Select JSON_VALUE ( @MaybeWeGotAVersion , N'lax $[0].Version' )

INSERT INTO @SemanticVersion (TheVersion, TheType)

Select coalesce(JSON_VALUE ( @MaybeWeGotAVersion , '$[0].Version' ),'1.1.1') as TheVersion,

'ourVersion' as TheType

end

/* now we see what versions of the data are available in the directory

we parse it into the three integers in three stages. */

--get the major version

UPDATE @SemanticVersion

SET Major

= Substring(TheVersion + '.0.0.0', 1,

PatIndex('%.%', TheVersion + '.0.0.0') - 1 ),

SoFar = Stuff(

TheVersion + '.0.0.0', 1,

PatIndex('%.%', TheVersion + '.0.0.0'), '' );

--and the minor version

UPDATE @SemanticVersion

SET Minor = Substring(SoFar, 1, PatIndex('%.%', SoFar) - 1),

SoFar = Stuff(SoFar, 1, PatIndex('%.%', SoFar), '');

--and the 'patch' version

UPDATE @SemanticVersion SET Patch =

Substring(SoFar, 1, PatIndex('%.%', SoFar) - 1 );

DECLARE @VersionOrder TABLE

(

TheOrder INT IDENTITY(1, 1),

TheType CHAR(12) NOT NULL,

TheVersion NVARCHAR(30) NOT NULL,

Major INT NOT NULL,

Minor INT NOT NULL,

Patch INT NOT NULL

);

INSERT INTO @VersionOrder (TheVersion, TheType, Major, Minor, Patch)

SELECT TheVersion, TheType, Major, Minor, Patch

FROM @SemanticVersion

ORDER BY Major, Minor, Patch, TheType;

--now we get the version equal or lower than the previous data

SELECT @CorrectVersion=TheVersion

FROM @VersionOrder

WHERE TheOrder =

(

SELECT Max(TheOrder)

FROM @VersionOrder

WHERE TheType = 'folder'

AND TheOrder <

(SELECT TOP 1 TheOrder FROM @VersionOrder

WHERE TheType = 'ourVersion')

);

IF @@RowCount = 0

RAISERROR('Sorry, but there is no suitable version of the data at %s',

16,1,@DataFolderLocation);

--SELECT @CorrectVersion;

DISABLE TRIGGER ALL ON DATABASE;

--now disable all constraints

EXEC sp_MSforeachtable 'ALTER TABLE ? NOCHECK CONSTRAINT ALL';

SELECT @command='

Print ''inserting ?''

SET ANSI_NULLS, QUOTED_IDENTIFIER ON;

if ''?''<>''[dbo].[flyway_schema_history]''

BULK INSERT '+@CurrentDatabase+'.?

FROM '''+@DataFolderLocation+'\V'+@CorrectVersion+'\'+@DataSource+'-?.bcp''

WITH (

DATAFILETYPE = ''widenative'',KEEPIDENTITY

);';

EXEC sp_MSforeachtable @command;

EXEC sp_MSforeachtable 'ALTER TABLE ? with check CHECK CONSTRAINT ALL';

ENABLE TRIGGER ALL ON DATABASE;

The current version of all the files is held on GitHub here.

Test it out: Build AdventureWorks and fill it with data

We can now try it out on AdventureWorks. We’re going to build it and check that it is the same as current AdventureWorks. We’ll use AdventureWorks2016, though any version will do. Here is our data folder location, with an AdventureWorks2016 directory, which is on the SQL Server that hosts the database from which we’re writing out the data.

Write out the source data

Now, we run the first routine for writing out the data to the data folder location, and we end up with a databaseContents directory with the data files for all the versions we need. We only need the one version, which I’d assigned V1.1.3.

Within this folder are a whole lot of files:

Create the empty target database

This now provides the data for a new build of Adventureworks. For this build, we create a blank database of AdventureWorks, which I called AdWorks. This must be done before the build with a separate process, such as SQLCMD. I’ve added the logic to delete it if it already exists. AdWorks is hosted on the same server as the source AdventureWorks2016 database.

USE master

IF DB_ID (N'AdWorks') IS NOT NULL

ALTER DATABASE [AdWorks] SET SINGLE_USER WITH ROLLBACK IMMEDIATE

IF DB_ID (N'AdWorks') IS NOT NULL

DROP DATABASE AdWorks;

IF DB_ID (N'AdWorks') IS NULL CREATE DATABASE AdWorks;

ALTER DATABASE AdWorks SET COMPATIBILITY_LEVEL = 130

IF (1 = FullTextServiceProperty('IsFullTextInstalled'))

BEGIN

/****** Object: FullTextCatalog [AW2016FullTextCatalog] Script Date: 08/02/2021 10:09:54 ******/

EXEC AdWorks.dbo.sp_fulltext_database @action = 'enable'

END

ALTER DATABASE AdWorks SET ANSI_NULL_DEFAULT OFF

ALTER DATABASE AdWorks SET ANSI_NULLS ON

ALTER DATABASE AdWorks SET ANSI_PADDING ON

ALTER DATABASE AdWorks SET ANSI_WARNINGS ON

ALTER DATABASE AdWorks SET ARITHABORT ON

ALTER DATABASE AdWorks SET AUTO_CLOSE OFF

ALTER DATABASE AdWorks SET AUTO_SHRINK OFF

ALTER DATABASE AdWorks SET AUTO_UPDATE_STATISTICS ON

ALTER DATABASE AdWorks SET CURSOR_CLOSE_ON_COMMIT OFF

ALTER DATABASE AdWorks SET CURSOR_DEFAULT GLOBAL

ALTER DATABASE AdWorks SET CONCAT_NULL_YIELDS_NULL ON

ALTER DATABASE AdWorks SET NUMERIC_ROUNDABORT OFF

ALTER DATABASE AdWorks SET QUOTED_IDENTIFIER ON

ALTER DATABASE AdWorks SET RECURSIVE_TRIGGERS OFF

ALTER DATABASE AdWorks SET DISABLE_BROKER

ALTER DATABASE AdWorks SET AUTO_UPDATE_STATISTICS_ASYNC OFF

ALTER DATABASE AdWorks SET DATE_CORRELATION_OPTIMIZATION OFF

ALTER DATABASE AdWorks SET TRUSTWORTHY OFF

ALTER DATABASE AdWorks SET ALLOW_SNAPSHOT_ISOLATION OFF

ALTER DATABASE AdWorks SET PARAMETERIZATION SIMPLE

ALTER DATABASE AdWorks SET READ_COMMITTED_SNAPSHOT OFF

ALTER DATABASE AdWorks SET HONOR_BROKER_PRIORITY OFF

ALTER DATABASE AdWorks SET RECOVERY SIMPLE

ALTER DATABASE AdWorks SET MULTI_USER

ALTER DATABASE AdWorks SET PAGE_VERIFY CHECKSUM

ALTER DATABASE AdWorks SET DB_CHAINING OFF

ALTER DATABASE AdWorks SET FILESTREAM( NON_TRANSACTED_ACCESS = OFF )

ALTER DATABASE AdWorks SET TARGET_RECOVERY_TIME = 60 SECONDS

ALTER DATABASE AdWorks SET DELAYED_DURABILITY = DISABLED

ALTER DATABASE AdWorks SET QUERY_STORE = OFF

Build-and-fill the database using Flyway and PowerShell

Now that we have an empty database, with the awkward bits of the fulltext database in position, we can build the database, using Flyway, and fill it with data. This is the contents of the Scripts directory of the GitHub project:

The V1.1.3_AdventureWorks.sql migration script is an ordinary build script. I generated it using SQL Compare, but you could also do it in SSMS. I gave it the version 1.1.3 to match what we used earlier. Flyway must run it outside a transaction, and we must declare it as ‘mixed’. meaning that Flyway should allow mixing transactional and non-transactional statements within the same migration. Normally Flyway attempts to run each migration within its own transaction but if it detects that a specific statement cannot be run within a transaction, it won’t run that migration unless you allow transactional and non-transactional statements to be mixed within a migration run. The entire script will be run without a transaction. I’ve added a config file to specify these options. It has the same name as the build file but with a .conf filetype.

The SQL callback script, afterMigrate_ReadEveryTableViaBCP.sql, will run after the V1.1.3 migration completes and has the SQL code to import the data and check that it conforms to the constraints. I also use the afterMigrate script from my previous article, Customizing Database Deployments using Flyway Callbacks and Placeholders. It applies the new version to the database as an extended property and puts an entry in the server log that the migration took place. We need this to pick out the correct dataset for the version if we have more than one version.

With all this in place, we just need the PowerShell code to connect to the database and run the migration. If the migration succeeds, the callback will automatically run and bulk load the data. The PowerShell automation script is more elaborate because we use a technique that allows a user who has UserName/Password credentials to keep passwords in a secure place within the user folder. If a password is not already stored, you are asked for it once. All you need to do is to fill in the variables at the start of the script, appropriately. You don’t need to provide the $username if you use integrated security, but you need to specify in the connection string that you are using integrated security (integratedSecurity=true). The $ProjectName and $ProjectDescription are used for adding a SQL Server Extended property on the database, and for logging the migration in the SQL Server log. The $datasource refers to the project name within which the actual data files are stored.

Under the covers, we are then assembling the parameters that we pass to command-line Flyway, ‘splatting’ them as an array. Just so you can see what is happening under the hood. the actual Flyway parameters are:

–url=jdbc:sqlserver://MyServer:1433;databaseName=AdWorks
-locations=filesystem:MyPathToTheScripts
-user=MyUserName
–password=MyFiendishPassword
-placeholders.projectDescription=A sample project to show how to build a database and fill it with data
-placeholders.projectName=AdventureWorks
-placeholders.datasource=AdventureWorks2016

Here is the code:

$ProjectFolder = 'PathTo\BuildAndFill' # where the migration scripts are stored

$Server = 'MyServer' # the name of the server

$Database = 'AdWorks'; # The name of the database we are currently migrating

$ProjectName='AdventureWorks'; # this gets used to write out extended properties

$DataSource='AdventureWorks2016'; #where you got the data from

# to describe the database for monitoring systems in an extended property

$ProjectDescription='A sample project to show how to build a database and fill it with data'

<# you only need this username if there is no domain authentication #>

$username = 'MyUserName'

$port = '1433'

# add a bit of error-checking. Is the project directory there

if (-not (Test-Path "$ProjectFolder "))

{ Write-Error "Sorry, but I couldn't find a project directory at the $ProjectFolder location "}

# ...and is the script directory there?

if (-not (Test-Path "$ProjectFolder\Scripts "))

{ Write-Error "Sorry, but I couldn't find a scripts directory at the $ProjectFolder\Scripts location "}

# now we get the password if necessary

if ($username -ne '') #then it is using SQL Server Credentials

{

# we see if we've got these stored already

$SqlEncryptedPasswordFile = "$env:USERPROFILE\$($username)-$SourceServer.xml "

# test to see if we know about the password in a secure string stored in the user area

if (Test-Path -path $SqlEncryptedPasswordFile -PathType leaf)

{

#has already got this set for this login so fetch it

$SqlCredentials = Import-CliXml $SqlEncryptedPasswordFile

}

else #then we have to ask the user for it (once only)

{

# hasn't got this set for this login

$SqlCredentials = get-credential -Credential $UserName

# Save in the user area

$SqlCredentials | Export-CliXml -Path $SqlEncryptedPasswordFile

<# Export-Clixml only exports encrypted credentials on Windows.

otherwise it just offers some obfuscation but does not provide encryption. #>

}

$Uid = $SqlCredentials.UserName;

$Pwd = $SqlCredentials.GetNetworkCredential().password

$FlyWayArgs =

@( "-url=jdbc:sqlserver://$($Server):$port;databaseName=$Database ",

"-locations=filesystem:$ProjectFolder\Scripts ", <# the migration folder #>

"-user=$($SqlCredentials.UserName) ",

"-password=$($SqlCredentials.GetNetworkCredential().password) ")

}

else

{

$FlyWayArgs=

@( "-url=jdbc:sqlserver://$($Server):$port;databaseName=$Database;integratedSecurity=true ".

"-locations=filesystem:$ProjectFolder\Scripts ")<# the migration folder #>

}

$FlyWayArgs+= <# the project variables that we reference with placeholders #>

@( "-placeholders.projectDescription=$ProjectDescription ",

"-placeholders.projectName=$ProjectName ",

"-placeholders.datasource=$DataSource ") <# the project variables #>

Flyway migrate @FlyWayArgs -mixed= "true "

Check the build

So, if this works error-free, and I’m quietly confident that it will, you can then check that the data in the new build matches the original, using SQL Data Compare:

In this example, the source database was built using Flyway, but with slightly different details. We deliberately didn’t copy across the flyway_schema_history table, so these don’t match. Otherwise, the data is identical.

The same goes for SQL Compare with the script compared with Adworks, except that one or two constraints and a trigger had been disabled in the source but not the target, because we just enabled them all.

Adding migration scripts

AdWorks is now a Flyway-managed database. although you need to be aware that the data is being added by a method that Flyway isn’t controlling directly.

If we wanted to add migration scripts to what we’ve done, that will be fine. How you do it depends on whether you want the data reloaded at the end of every build. If you do, then empty the database before you start each run with the supplied routine, and make sure that you have a new dataset in your datastore each time you change any of the tables.

However, if you want subsequent migration scripts to manage any data movement, then simply remove or rename the AfterMigrate script that loaded the data so that Flyway doesn’t attempt to BCP data into tables that are already filled. When you have added the required chain of migrations, you can save the new layout of the data using the first SQL script and then replace the AfterMigrate script back into the directory, create a new database, run the whole migration set, and it will load the new data at the end. If you have a lot of data, you’ll find this to be the fastest way of doing it.

Conclusions

I’ve now given you all the information you need to use Flyway to do a conventional build. You can, of course, do the same with a migration, but if you can provide the data once you reach the right version of the database, why bother with a migration that preserves data? Flyway will obligingly take you straight to the version you want if you can provide it with a build script, and then it will quietly fill it with the data for you via the script in this project. I’m not suggesting that there is any ‘better’ approach to producing the database at the correct version. Flyway will fit in with the way that a development team has chosen to work and feels comfortable with.

Tools in this post

Redgate Flyway

Bring stability and speed to database deployments

Find out more

Redgate Flyway

Bring stability and speed to database deployments

Find out more

Product articles Redgate Flyway Database Testing and Quality Getting Data In and Out of SQL Server…

Guest post

Getting Data In and Out of SQL Server Flyway Builds

Guest post

The separation of data and metadata

Storing datasets

Maintaining datasets per database version

Multiple datasets at the same version

Writing data out and reading data in

Extract the data from the source database using BCP

Load the data into the target database from the BCP data files

Test it out: Build AdventureWorks and fill it with data

Write out the source data

Create the empty target database

Build-and-fill the database using Flyway and PowerShell

Check the build

Adding migration scripts

Conclusions

Tools in this post

Redgate Flyway

Redgate Flyway

You may also like

Level Up with Flyway: Database Development with Flyway - Module 1

Why standardizing migrations across multiple database types with Flyway adds up for Desjardins

A Simple Example of Flyway Development using GitHub Branching

Automating Migrations for Multiple Databases using Flyway

A Real-world Implementation of Database DevOps: People, Processes, Tools

Flyway's Repair Command Explained Simply

Product articles Redgate Flyway Database Testing and Quality
Getting Data In and Out of SQL Server…