I've been updating my family tree and in so doing often found web pages with useful information I wanted to copy but I got sick of all the copy/paste for person after person, especially when it was a list of several people! Even more aggravating was that most sites have a nice structured that "should" be easy for a program to extract the relevant data from.
What I needed was a way to pass raw text into a format that most Genealogy applications/sites could understand [GEDCOM files]. After lots & lots & lots of searching and testing of the import capabilities of lots & lots & lots of Family Tree programs and finding nothing remotely useful I decided I needed to write one myself!
Key Requirements:
- Text layouts can vary widely so the program had to be able to be easily adjusted to suit [Some sort of user configurable rules would be needed]
- Interactive/iterative approach needed to allow rule generation/testing
- Handle marriages and multiple generations of descendants
- Output would need to be in a GEDCOM format
- Running on WindowsXP (and hopefully above)
The result:
Program: FamilyTreeHelper.exe
Usage (Simple formatted text file [eg: Simple Text File]):
- Create a text file with with following layout...
- <@><Name>, <Sex>, <Birth date, <place>>, <Death date, <place>>, <Marriage date, <place>>
- Note: @ = + If child of the above [use multiple if the above is already a child]
- Note: @ = _ If spouse of the above [use after child if required]
- Paste text in the top box
- Can be edited further if required
- Use a rule button to pre-load the rules or create your own
- No need to edit if you followed the suggested format
- Run the Rules
- The results should be a list of names with some keywords/markers
- Process all
- Generates the INDI & FAM sections of a GEDCOM file
- Save generated file
- Creates a GEDCOM 5.5.1 compatible file
Usage (Web page text [eg: My Genealogy - Family Tree Maker]):
- Paste text in the top box
- HTML & Special characters will automatically be stripped!
- Use a rule button to pre-load the rules or create your own
- Use the [E]dit/delete or [A]dd buttons to modify any of the 4 different rule types
- Run the Rules
- People list will be generated with names and keywords/markers as follows...
- + = Child of the person above [supports nesting with ++, +++ for extra generations
- _ = Wife or Husband of the previous person [would ideally have the wedding info]
- |_S_ = Sex [followed by M or F]
- |_B_ = Birth detail [date, place (can have multiple commas in place]
- |_D_ = Death detail [date, place (can have multiple commas in place]
- |_M_ = Marriage detail [date, place (can have multiple commas in place]
- Click on a result and press "Test individual..." to see if the data parses properly
- Fills the individual data elements to the right
- Note: Data elements can be edited before processing!
- Press "Process the test"
- Appends the INDI and FAM [GED] sections for that user
- Repeat for next user and data appends so you can step through 1 by 1
- Press "Process All"
- You can skip this if you did the people 1 by 1
- Save generated file
- Creates a GEDCOM 5.5.1 compatible file
Licensing:
This is Version 1.0 so I'm sure their is lots of room for improvement, particularly as more complex text layouts are tried. As such, I'm not going to put a price on it as yet. If you test it and it looks like it does what you want eMail me if you want to activate it [Note: Donations improve chances].
This software is provided "AS IS". There is no guarantee or warranty on this software and I am NOT responsible if it eats data. Back up your data just in case. This software comes WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. You download and use it entirely at your own risk.