I need to generate (somewhat realistic and connected) synthetic data using the source schema and noticed White Rabbit has the ability to generate fake data from your source scan.
My concern is that this is not truly “fake” and it would be easy for PII to leak in. How valid is this concern and is there a recommended approach for a task like this using White Rabbit? Should it not be done at all? I’m wondering if I should do this purely manually or if White Rabbit can help and I just need to comb through it to make sure it is truly fake.