I’ve lost count of the number of times I’ve had the need to generate a representative dataset of users. Of course I have access to many production datasets but for many reasons they can’t be used. Finding previous datasets I’ve randomly generated always seems to take longer than it should, so with my most recent iteration of having to generate a fictitious list of users with Australian addresses, I’ve documented how I went about it, along with the source data I used and the script to create it.
For my data sources to base my dataset off, I wanted representative data for Australia for both people names and locations. After a few quick searches I found;
The script is pretty simple. It imports each of the CSV’s listed above and generates a random number based on the number of records in each file.
The GitHub Repo contains the PowerShell script along with the source files. Change line 3 for the location where you store the CSV files and change line 66 for the number of users to generate. I’ve left the end of the script empty. I either insert the API call to create the users, or the PowerShell cmdlet with the data to do the creation depending on where I’m creating the users.
Here is a sample output in JSON format.
{ "Street": "370 Miskin St", "Surname": "Burne", "Suburb": "WOODBROOK", "Postcode": "3451", "State": "VIC", "GivenName": "Miro" } { "Street": "293 Preston Rd", "Surname": "Partingale", "Suburb": "MARRARA", "Postcode": "812", "State": "NT", "GivenName": "Daniella" } { "Street": "409 Orchard St", "Surname": "Liaseyer", "Suburb": "THURGOONA", "Postcode": "2640", "State": "NSW", "GivenName": "Ariana" } { "Street": "775 Station Rd", "Surname": "Nevin", "Suburb": "AVON DOWNS", "Postcode": "862", "State": "NT", "GivenName": "Naria" }
Using data publicly available and PowerShell it is possible to quickly generate a dataset of representative users and addresses. Generating other attributes is as easy as extrapolating from the existing data or supplementing it with additional source data files.
Today, I’m super excited to finally announce the Beta release of EntraPulse Lite – a…
I'm excited to share some significant authentication enhancements I've contributed to the Lokka MCP Server…
Last month I had the pleasure of speaking at the Sydney event for Global Azure.…
Model Context Protocol (MCP) is a powerful framework that extends AI clients like Claude and…
I've just completed participating in the Azure AI Developer Hackathon that was looking to provide…
Updated: July 2025 v1.0.2 Fixes issue setting D365SalesGlobals enabling session management for D365 Sales API…
This website uses cookies.
View Comments