In September we wrote about what is a common problem in the genealogy community: not enough time to balance work, family, and our research. We took a radical step and decided to stop working on our family tree, so we could instead focus on wrapping up other projects and to get our research, citations, and document/photo collections in order.
It’s been 7 months since wrote that piece (Link)…here’s how the plan’s going:
Keep blog posts to 500 words (Grade: F) – Our last post was almost 2000 words, our average since September is about 1000, AND we have been lucky to post 2 times a month instead of publishing weekly like we intend. Lots of work to do here.
Clean, scan, and store the 2000 glass plate negatives from the Home Studio Collection (Grade: B+) – We decided not to clean them, we have the storage taken care of with all of the proper archival products. However, we got to about 70 plates scanned in October and stopped. It was too much time per plate. In response, in the last 3 months we purchased a lot of equipment to scan these more quickly and we were able to process 225 plates in about 2 hours last time we tested. We are about to finish the first crate of plates (~440 images) in 10 days! We’ll publish more details when we’re done.
While we are scoring a GPA just over a C, it’s mostly because of we failed at our primary approach of stopping work on our Family Tree.
Install a temperature/humidity control solution in our archive room (Grade: A) – It’s done! Our archives have spent the last 5-6 months at 62-65 degrees, and 42-45% humidity! There will be more work to do this summer, to cool/dehumidify the space, but we already have the controls in place.
Ensure each Source in our main, Public tree is properly cited and every Fact is supported by at least one Source (Grade: Inc) – To be fair, we started on this effort, beginning with 4xGGP Royal and Eliza (Jones) Morse (whom we’ve owed this documentation to the Morse Society for 2 years now!) 4 times, but the Family Tree Maker data corruptions kept setting this effort back (MacKiev’s Family Tree Maker is garbage). Once we got past that, we’ve made some progress, but we lost a lot of effort on this one.
Properly transcribe and index all family history interviews (Grade: C) – We’re half way there! But the last set of transcripts will be the hardest.
PUBLISH! (Grade: F) – Out blog posts are lagging, and we haven’t actually published anything outside of blogging. We’ve been reading some great family histories though, to get an idea of how others publish their stories, so we have good ideas once we’re ready!
Write our autobiographies, as well as begin to write out what we know about our family (Grade: F) – Yeah, so…we’ve written nothing. Looking back, this probably should have been left off the list…we were just setting ourselves up for failure.
Ensure that we’re printing out all electronic sources, so that our paper files are complete copies of our electronic files (Grade: A) – We’ve been pretty diligent on this one, and while we’ve had to redo a lot of electronic source citations, we did print them all out as we created them.
Spend a little more time with the family! (Grade: A) – We could have made a lot more progress on this list if sacrificed this one, but we’re spending more time together as family now than when we first wrote this.
While we are scoring a GPA just over a C, it’s mostly because of we failed at our primary approach of stopping work on our Family Tree. Instead, we shattered our largest brick wall (while adding nearly 40 new DNA matches) and made a huge dent in another brick wall. Much more on that will follow, but both efforts took a LOT of work, and those hours spent working on the trees directly slowed down the other work we hoped to accomplish.
That said, we’ve rededicated ourselves to getting these other tasks complete so we can finally turn our full attention to building our tree. Our properly cited and sourced family tree!
Before we get into the amazing connection we’ve made recently, here’s a quick update on the broader collection of 2500 glass plate negatives we were able to rescue that comprise the surviving images from the Home Portrait Studio. We talked about how saved these plates earlier, and about how we intend to scan and share all of these images publicly (Our project to save a piece of Racine, Wisconsin history, Part 1 – Getting Started), but in the months since our last update we found out it would be nearly impossible to use a flatbed scanner to capture those thousands of images without it taking years to complete. Fully processing 50 negatives was taking about 8 hours. We’ve since bought a bunch of equipment to instead photograph each negative digitally, and invested in software that allows us to now fully process those same 50 images in just under an hour. We expect that with this new process we’ll the entire first box up and shared next month (440 images), and we’ll provide much more detail as soon as they complete.
In the meantime, through an amazing set of coincidences we determined the name of the man in the WWI uniform we posted originally, met his grandson, and learned more about Frank Stritesky who owned the Home Portrait Studio.
The photo is William H Rastall (born in 17 Jun in Racine, Wisconsin) and who served in the US Army from December 1917 to January 1919. William was the brother of Frank’s wife Anne, and lived in the same neighborhood.
Just days after we first shared these images on the “Racine History” Facebook group, and on this blog, William’s grandson Michael began searching the Internet to buy an original or reproduction of the patch on William’s uniform. The photo of William with the 119th patch was a family treasure that had been passed down.
Michael’s first search turned up our scan, posted to a military insignia message board that we’d used to try and narrow down the timeframe that image was taken. Michael has prints of this family heirloom, and figured that the negatives were in the home that housed the studio which was owned by Frank’s daughter Betty, so he was floored to see a full-frame version of the picture randomly by a stranger on the Internet!
He reached out the next day, and after exchanging texts and phone calls, it turned out we were close enough for lunch, and got to meet and exchange stories. Michael was able to share many of the family pictures that were taken by Frank over the years.
The image we first published of William was almost certainly taken in 1918, at a time when Frank was just getting Home Portrait Studio up and running, and during that time he’d often take photos of family to practice/build his portfolio. William married after returning home from his WWI service, and he and his wife Alice had one son, Patric who had a life-long interest in photography owning back to his Uncle Frank. Frank had let Patric work the cameras from time-to-time when he was younger, and Patric’s interest eventually rubbed off on his son Michael, who works as a professional photographer in the Chicago Suburbs (Michael Rastall Photography).
Patric and his wife had visited Betty decades ago hoping to salvage the remaining negatives and maybe some of the studio gear, but Betty wasn’t interested in letting anything go. Given the condition they were being stored in there was little hope they survived after Betty’s passing, so it came as a complete shock to the family that they were seeing the light of day. With any luck, as we continue through the 6 boxes we saved they we’ll find more of those family treasures.
We’re also hopeful we can eventually reunite many more families with these amazing images of their ancestors, but for now we’ve at least made that first link!
In this installment we’re going to walk through a key tool to help narrow down where to research when you have AncestryDNA tests that match your family, but despite your research you’re not sure where they match. DNA Painter has a great tool called What are The Odds that gives us the probability of where these unmatched lines link up with our own.
We’re using a real set of unknown matches for this example. Emma Kupps (1879-1953) is a one of our favorite ancestors. She was born and raised the various logging communities that sprang up in North Central Wisconsin in the late 1800’s, but her family settled in Antigo where she graduated from Antigo High School. Within a few years she would married a logger Daniel Leonard (1868-1924), who would soon become Antigo’s Fire Chief, and years later be elected Sheriff of Langlade Coounty, Wisconsin. During his term Dan became ill with cancer, and succumbed with a significant portion of this term remaining. The governor of Wisconsin appointed Emma to the position of Sheriff to complete her late husband’s term, and she became the first woman in Wisconsin to hold the office. (Langlade Co. Historical Society)
DNA Painter has greatly narrowed down where we’re targeting our on-going research to finally break down this brick wall.
But, to family historians, she’s also near the end of a line that is a classic brick wall. Her father died young, and there’s nothing but a couple of records that indicate only the names of his parents. Plus, they are the only lines in our family that come from Bohemia, so it has the combined brick walls of classic genealogy and DNA results.
We’ve identified a group of AncestryDNA matches that have strong Bohemian roots and match descendants of Emma. We used Michael’s Great Uncle as our target DNA match, since he’s the oldest generation tested on that line, and we built a master tree that links as many of the unknown DNA matches as we could. We ended up with 8 AncestryDNA matches that we could link together in a cluster.
The cluster all share Jacob Haasl and Dorothy (Johannek) Haasl as their MCRA, but we haven’t been able to build a link between Great Uncle Leonard and the Haasl’s. So, we’re going to turn to DNApainter’s “What are the Odds” tool, to help identify where we’re most likely linked to the cluster.
When you open “What Are the Odds?”, it will present a box for the most recent common ancestors (MCRA). The options are to “Edit Names”, “Add Child”, or “Add Parent”. In this case, we’re going to edit the name, and add the cluster’s MCRA, Joseph and Dorothea Haasl. When we enter that information, we’re presented with the same 3 choices, but this time we’re going to start building a line to one of the DNA matches but selecting “Add Child” and entering the name of the child that makes up the first step to our DNA match. At first we were surprised how quickly we built out a tree, but it’s because we’re not entering all the data we’d need for a regular tree, just the names!
When we reached a DNA match we entered the cM value that matches our known DNA test. We repeated this step for as many matches as we’ve identified. This works well with a single match, but better with more. In our case we identified 8 matches, so we’re built them all out. Now we’ll really see the power of this tool.
Now that we’ve entered what’s known, it’s time to start mapping out our guesses. In fact, the entire purpose of this tool is to compare the likelihood of at least 2 hypotheses matching the entered cM, and from those likelihoods we can focus on where it’s most likely we all share a most MCRA.
The most likely connection for Jacob and Mary Keips’ line is her parents. We don’t know her maiden name, or birth date, but if we guess that she was born in 1820-1825 it’s reasonable to guess she is a sibling of either Jacob Haasl or Dorothy (Johannek) Haasl, so let’s build that out as option 1. We’ll add an “Unknown 3xGGP” to Joseph and Dorothy, and add a child called Jacob/Mary (because it could be either!). From there we’ll build down to the Great Uncle that is the known DNA match, and select “Use as Hypothesis”.
It shows us a probability of “1” because DNA painter doesn’t show you raw percentages, it shows you comparative probability of one match vs. another. For example, if you enter two hypotheses and one returns “1” and the other returns “2”, we’ll know the second one is twice as likely as the first. In this case, we have no other hypotheses entered, so it shows just a 1.
Given the cM match, it’s most likely that we match the cluster with Great Uncle Leonard’s 3x to 5x GGP’s, so we built out the same line as above, but this time with one more unknown ancestor above Jacob/Dorothy Keips, which would then make Uncle’s MCRA a 4xGGP.
When we built that out, and selected the second “Great Uncle Leonard” as a hypothetical, it soared to a whopping score of “1174” vs. the first “Great Uncle Leonard!! Given that we have 1174 for one possible link and 1 for the other, DNA painter just told us that while not impossible, we’re looking for a 4xGGP as our MCRA, not 3xGGP. Not great news, since now we have to go at least two more generations back, and to build this match back further we’re going to have to dig deep into 18th Century European genealogical records. That’s not our strong suit. But, at least now we have a clear picture of where we’re looking to link these groups.
Since the range of likely Great Grandparents is 3x-5x, we then built this hypothetical out to our match’s 5xGGP, and we see the same score of 1174 from a hypothetical 5xGGP. That means it’s equally likely that our link to this cluster of match is through our Great Uncle Leonard’s 4xGGP or this 5xGGP, but it’s almost certainly NOT through his 3xGGP.
While in some ways this is disappointing, and we’d hoped to come through with a match, this is actually a huge piece of this brickwall puzzle. When we started the work on this DNA cluster we knew that John Keips/Kupps had migrated from Bohemia and, at the time of his death, his wife thought his father was Jacob D. Kupps when she filled out her husband’s death certificate. From their marriage certificate we knew John’s mother, and Jacob’s wife, was Mary. We also knew we had a large cluster of DNA matches who came from the area of Bohemia.
Just by going through that cluster, building out a central tree that links them all, we found a great lead that likely shows John’s arrival information, along with approximate birth years for Jacob, Mary, and John…as well as John’s previously unknown siblings who seem to have a long history together in the US, and left many records. That means instead of having exhausted all the on-site research we could do on the John’s line, we now have a large number of leads to follow and see if we can push back another generation from both Jacob and Mary. We now know enough to start targeting death certificates for both, which may contain critical names, as well as 6 more marriage/death certificates to look for Mary’s maiden name, as well pieces of evidence that link our Jacob to the arrival Jacob. And, DNA Painter has greatly narrowed down where we’re targeting our on-going research to finally break down this brick wall.
We also have about 20 trees integrated into the master tree, and all of their owners are likely working towards the same goal as we are. As they do their research, and new DNA matches are added to the mix over the years, it’s likely one of us is going to have that piece of the puzzle we’re missing, and finally put it all together.
(A quick note, we’re using the newer version of GEDmatch called “Genesis” for this walkthrough. It’s soon to be the only GEDmatch, but if you’re not familiar with it, please use GEDmatch Genesis for this example.)
Back in our 1st installment of this series (Link), we suggested that you upload your AncestryDNA results to GEDmatch. We’re hoping you’ve done that, and we’re going to walk you through why this site is so powerful.
GEDmatch is a private site that is run out of Florida, for the purpose of allowing genealogists to upload their tests from all commercial DNA providers, with a complete set of tools to help us make matches between different tests. The key to GEDmatch is both its great tools, as well as it being open and public with all of the tests that are uploaded, but that’s also the warning that goes with GEDmatch: your tests are searchable by anyone. Your raw data is never shared, but your match information is openly shared to anyone that joins, including law enforcement. There have been a lot of stories shared about how GEDmatch was used to solve nearly 30 cold cases (so far), and reunited 10,000 adopted children with their birth families. There is a real chance your uploading of your DNA data is going to unearth secrets that your family doesn’t want unearthed.
At first GEDmatch was bewildering and confusing to us…and most of the walk-throughs we saw online were so detailed, it only further confused us.
We’ll be honest, at first GEDmatch was bewildering and confusing to us…and most of the walk-throughs we saw online were so detailed, it only further confused us. So, we’re going to focus more on the functionality GEDmatch than the technical details. This is still rather advanced, but when you practice with your matches, and then search for the more the technical details, you’ll find it makes a lot more sense.
Each test uploaded to GEDmatch is assigned a kit number, and most of the tools will require you to enter either your or your target match’s kit number. Especially when you have multiple kits you’re working with, it will be helpful to have your main page with all of your kit #’s open while you work in other tabs. It’s easier to cut and paste.
One-to Many Comparison
On your home screen you will see all of your kits, and if you click a kit # you’ll be taken to the first tool: One-to-Many Comparison. When the first box comes up, to start with we suggest you filter on only those matches that are 20cM and up. A whole list of names and numbers will pop up, and this is the list of everyone who’s uploaded a kit to GEDmatch that matches your kit. Let’s walk through what each of the boxes mean:
Select – You can pick at least 3 kits here and shortcut several of the tools by clicking on the “Visualization Options” button on the top header.
Kit #, Name (Alias), Email – This the information for you match, including a REAL email address! No more sending an Ancestry message, never hearing anything, and wondering if they ever log into Ancestry. Some people (and we recommend this for everyone managing someone else’s kit) use an alias for their testers name.
GED/Wiki Tree – You can enter a GEDcom-format family tree, or a link to your WikiTree, to help your matches find that MCRA. We’ll walk through this later.
Age – How long has the kit been on GEDmatch
Type – The chip version your test was from. Ignore this, it won’t matter for what we’re doing
Sex – If the GEDmatch uploader didn’t select a Sex for their tester, this will be U
Haplogroup – Shows which group you’re line is in if you took a mtDNA (mothers) or YDNA (fathers) DNA test. Ancestry doesn’t offer these tests, and if there are values in this for your matches, ignore them for now.
Autosomal – This is the type of DNA test Ancestry offers, and it’s what you think of when you think of DNA tests (ie: 50/50 inheritance from mom and dad). This is the area we’ll most care about, and it lists the total cM’s you share with the match and the largest piece of match in your samples. The “Gen” value gives you an estimate of how many generations separate you and this match.
X-DNA – We won’t cover this much, although it will get at least a mention in later post. X-DNA won’t be of much value, except in one very specific way.
You can click the header of each column to sort the results, but in this case let’s take a look at one of the higher cM matches on my list that I don’t yet know who they are.
One-to-One Autosomal DNA Comparison
For our example we’re going to select kit #M717701, especially since they have a family tree shared for this match. Clicking on the “GED Wiki” link in the GED column opens a link to their public tree on wikitree.com, but reviewing all of the surnames there, I don’t see any that jump out at me. I’m going to copy the kit # and go back to the home screen, to check out our next tool: One-to-One Autosomal DNA Comparison
Here’s where you actually confirm your matches to other kits, and start to get a view into the actual DNA segments that all of these matches are built off of. In the Ancestry tool, you take their word for it that you match someone, but this tool will actually prove you’re related.
Entering our kit # first box, and pasting in M717701 in the second, we get a picture of exactly where we match, or if we actually share no significant DNA (it happens!). In this case, we match the sample on Chromosome 14, and the exact start/stop positions of the match are listed. (We’ll come back to that in our next post, about DNApainter).
If you scroll all the way down to the bottom there’s more detail about this match, the most important info showing the estimated number of hops to our Most Common Recent Ancestor (MCRA). In this example it shows as 4.4 between us and the GGP that we share, but it’s best to double that number and use it to estimate the number of people between you and your match. 4.4×2 is 8.8, so we’ll round up to 9, and when it comes time to build out the link between us we’ll expect that it’s likely 4 or 5 ancestors up to the MCRA, and 5 or 4 ancestors back down to the match. Without even going to the “Shared cM Project” graph, we can guess that it’s most likely the MCRA we’re looking for will be a 3xGGP (4 ancestors from me) or 4xGGP (5 ancestors from me). But, we need to narrow down which side of the family this match is on, which brings us to our next tool: People who match both, or 1 of 2 kits.
People who match both, or 1 of 2 kits
This tool is very powerful, and where we do most of our work, and make most our matches. This is especially true if you have more than one kit in GEDmatch, because you can quickly narrow down target matches to either being on your mother’s or father’s side of the family. For example, I have my mother’s and a paternal uncle’s kits uploaded, so if I use GEDmatch to show me all of the kits match my test and my target test, I can pretty quickly see if it’s on my mother’s side, or my father’s. Plus, I have many known matches that we don’t manage, so we can usually get pretty lucky on narrowing it down further.
For our example, we ran this test on our target user from above and we see that they match my mom’s test also, so we’re looking at a MCRA on her side. We just cut our search in half! Looking more closely, her cousins on her father’s side have maybe a dozen tests in GEDmatch, but none of them are listed here. It’s very likely then, given the volume and closeness of the paternal cousins’ tests that if this was on my mom’s father’s side, they would show up in this list…so we’re going to assume this match is on my mom’s mother’s side. We just took the total number of potential MCRA’s down in half again, so we’re really narrowing in! Given that we already have 21 of 32 5xGGP’s identified in that branch of the tree, odds are pretty good we’re going to find a common surname.
In this case, we’re going to have to do what we did in our last post (Link), which corresponds to DNA secret #2: we’ll spend most of our time building out other people’s trees! Looking at the GED Wiki tree they have up, there’s a decent 4 generation tree on the match’s father’s side, but only a mother’s name and birth/death. Since nothing on the father’s side jumps out, we’ll build out their mother’s side first.
The other huge advantage at this point with GEDmatch is that we have the direct email address of our relative immediately, and so, once we’ve validated we’re a match with the 1:1 tool, we can reach out to what’s likely their regular email account and see if they have more information.
Triangulation is confirming that not only does your test match the target’s kit, but that another known ancestor’s kit matches as well. In the above example we confirmed I matched with the “One-to-One” tool, and we linked my mom with the “People who match both” tool, but technically we’re just guessing my mom’s kit also matches the target kit. I’ve never had it be wrong, but to really prove the match it would be best to confirm the matching segment for myself, my mom, and the target all match. There’s a GEDmatch “Tier1” tool for Triangulation (and we find it VERY worth it to donate $10/month to get the Tier1 tools!), but it’s just about as easy to do a One-to-One test between my mom and the target to do a full triangulation.
Once we prove out the family tree link to our “Cake” ancestor, and we have our MCRA, we will have really narrowed down how we’re linked to everyone else on the “People who match both” list. There’s nearly 50 matches on that list that we now have very solid evidence on who the MCRA is. Even if we have to build out most of those trees manually, it’s likely we already have pieces of those trees built out when we made the first match.
That being said, it’s one of the reasons it’s so frustrating that Ancestry has such weak tools. If we could combine Ancestry’s strong base of Public Trees with GEDmatch’s tool set, our effectiveness when making DNA matches would be extraordinary! It’s also why it’s so important to upload/link a GEDCOM file to your GEDmatch tree. It doesn’t have a great interface, but even ugly tree’s make this search much, much easier!
Using just these 3 simple tools you can build out many matches using GEDmatch. Just keep practicing, and quickly it’s like you’ve been using it for years. And, as your skill in the tool grows, those overly technical online walk-throughs will help further unlock the power of AncestryDNA tests!
In our next installment, we’re going to use DNApainter to narrow down those matches were we have no data on how we’re connected to a verified match, so we know where to focus our traditional genealogical research, and we’ll talk about some of the more advanced topics we won’t dive into too deeply in this series.
Once you have all of your hints with notes attached, look for your largest unidentified match that has a Public Tree, even if it’s unlinked. Since the tree has no hints you’ll know their tree doesn’t match yours (yet!), but the larger the amount of cM the closer your match…so the fewer relatives you’ll have to build out.
Before we dive in with our first example, here’s the first of several surprising truths about doing genealogical DNA work: you will spend most of your time doing other people’s trees. In a perfect world your DNA match built out their tree to their 4xGGP too…but you’ll find most times you won’t be so lucky!
DNA match surprise #1: You will spend most of your time making DNA matches building out other people’s trees.
Example #1: Eileen Wilson
As an example, let’s look at a match with an Unlinked Tree and 242 cM of shared DNA. Looking at the match’s Public Tree, the names don’t jump out (other than the very common “Smith”), but based on our notes, we know it’s on Michael’s Father’s Mother’s Mother’s Father’s line…which was William Arthur Smith. To confirm, we entered the cM in the DNA Painter Shared cM Project tool (Link), and the results (eliminating the ½ siblings) indicate the most likely matches are from our tester’s Grandparents or Great Grandparents.
Expanding our tree, we see that we had already identified Wallace David Smith in our tree, his wife Mabel, as the brother of William Arthur Smith. We also had their daughter Lula, which all of which sync’s up with the match’s tree. From there, it’s pretty easy to prove out that the DNA match is the daughter of George and Lula (Smith) Hopkins.
This is the same process whether the DNA match is 242 cM or 12 cM: use common matches to narrow which line the DNA likely matches you, identify the most likely target for your match through tools like the Shared cM Project charts and the “What are the odds?” tool. From there, build out the likely tree based on your estimates until you find the match. Then, you update the notes so when you find another shared match, you’ll have the info to narrow down their DNA line!
Make sure to make a note in your match, as we did in Part 2 of this series, so you’ll be able to focus in on other matches with less shared DNA later.
Let’s try one more match that’s a little further out.
Example #2: “A.G.”
The next target is going to “A.G.”, a woman that shares 26cM with the same tester as the first example. The first thing we do is review A.G.’s shared matches with us, and we see notes indicating that the match is on our tester’s Father’s side, so we just narrowed our focus to that ½ of the tree. Next, we went to DNA Painter Shared cM Project tool and maped out the most likely matches for the level of shared cM, which shows that it’s likely our shared match is around a 4th Cousin, Once removed or a 5th Cousin. This means, it’s most likely we’re looking for 3x/4x GGP as our MRCA.
The good news is that we have a strong tree to 3x/4x GGP’s for our tester. The bad news is, we have no tree for A.G. and we’re going to have to build hers out to understand where we match. We followed our own instructions on building out a “quick and dirty” Ancestry tree (Building a good Public Ancestry.com tree – Part One: sources, citations, facts, and proof), especially paying attention to find proof of relationships between each generation. And, in the end, after about 16 hours of total work we found…nothing. No match.
Which brings us to our second of our surprising truths about doing genealogical DNA work: Your DNA matches will mostly be on family lines you already have great information on, and conversely most of your unlinked DNA matches will be on family lines which are already your brickwalls.
DNA match surprise #2: Your DNA matches will mostly be on family lines you already have great information on, and conversely most of your unlinked DNA matches will be on family lines which are already your brickwalls.
In this case, we have limitations on several key areas of our family tree. Our tester is Michael’s grandmother, and on her paternal line we run into a pretty solid brickwall at her 2x GGP. They likely were born in either New Hampshire or Vermont, before they migrated to Michigan through New York, but 3 generations of family historians haven’t gotten past Alvin Jewell (1830-1911). In A.G.’s line, there are two couples from Vermont, from about that time, but there’s not enough evidence to pursue a solid line of inquiry.
This brings us to of our third surprising truths about doing genealogical DNA work: Even with the best of trees, and hours of effort, you’re going to have a lot of matches that you’re not going to be able to link to your tree.
DNA match surprise #3: Even with the best of trees, and hours of effort, you’re going to have a lot of matches that you’re not going to be able to link to your tree.
This is also where the limitations of AncestryDNA start to become apparent. There are nearly no tools there to help us determine which side of our match’s line do we expect we match. How can we leverage DNA triangulation to further narrow down where we should be researching? When you’re trying to figure out where to look amongst 32 GGP’s who might be a key to your DNA match, being able to eliminate ½ of those potential matches is a huge boon. But, beyond what we’ve already gone through, there’s not much more they can offer.
One of the other limitations of AncestryDNA is that you can never prove your matches. Even in our first example, we have a good tree match, and the amount of shared DNA (242 cM) matches exactly where we’d expect the two samples to match (2nd Cousins), but without tools like a chromosome browser, it’s impossible to prove those two kits match as we’ve assumed.
We’ll be looking at other tools in later installments, including how we can narrow down the search for our MCRA link to A.G.
In our next installment we’re going to go through a GREAT set of tools in GEDmatch that will demonstrate what we wish we had in Ancestry, and we’ll show you how to leverage you DNA results there to really unlock your matches.
Last week we discussed the (very!) basics of DNA testing, and we’re going to take that jumping off point and walk you through how we identify our AncestryDNA matches.
So, congratulations you got your test results back! Now what?
Check out our Ethnicity, and then move on
We wrote extensively why Ethnicity is not a valid part of genealogy and it often does more have than good (It’s time to stop giving attention to “Ethnicity” and genetic admixture), but everyone wants to look at it first (including us!), so give it a read through…and then be done with it. You can go back when you’re bored, but for now let’s get to some real work!
Export your results to GEDmatch
AncestryDNA has the largest DNA database, and the largest set of trees to help establish DNA matches, but their toolset isn’t even basic. Their tools are essentially non-existent. GEDmatch is a free site that provides a great tool set, and results there are used for some of the most important tools you’ll use as you progress deeper in mining your DNA matches. Plus, tests from all of the major sites can be compared on GEDmatch, so you will find 23andMe and Family Tree DNA kits matched to your AncestryDNA there. You’ll also get direct email addresses to your match!
Assuming you want to move forward, we suggest that you start the transfer process first thing since it will take a few days for GEDmatch to full analyze your Raw DNA data.
We’ll come back to GEDmatch in a future post in this series, as we dive deeper into some of the great tools available there.
Evaluating your matches
Now, let’s get to the first matches! That first look at what’s likely to be 2000-3000 DNA matches is overwhelming, but we’re going to break everything into smaller and smaller bits until we can really start to leverage these matches.
If you followed our Part 1 advice of building our your tree to your 4xGGP, with any luck you’ll now have matches with “Hints” (indicated by the little shaky leaf next to the “Match” button). In the “Filter’s” section, click on Hints, and it should show you only your matches with the shaky leaf. When you click “View Match” button, the path of connection between you and your match should come up.
In the example to the left, our Most Recent Common Ancestor (MRCA) is our Great Grandparents, and Ancestry has mapped out each of the steps between us. Given AncestryDNA’s limited tool set, we only have a few ways to successfully build out our matches. The first of these was Hints, the next one we’ll use is “Notes”
How to use “Notes” to quickly identify your matches
There are literally a million ways of using the Notes fields in AncestryDNA, and we don’t claim this to be the best way…or even a good way. It’s just the way we’ve found most helpful.
For each MRCA we assign a Line #, and in this example Charles and Rhoda Smith are Line 11. This is the note we’ll use:
Rick-MMF (Not Researched, Charles Henry Smith and Rhoda Upper) – Line 11
Which breaks down as:
Rick-MMF – We do our genealogy from the standpoint of our son, Michael, and so this helps quickly identify that the match on Michael’s Father’s Mother’s Mother’s Father’s line.
Not Researched – Ancestry, or other quick research like a “Notes” entry has said this is a match, but we haven’t gone through yet and confirmed the genealogy nor have we added this match to our tree yet.
Charles Henry Smith and Rhoda Upper – The Most Recent Common Ancestor (MCRA) between us and our match.
Line 11 – The line # for these MCRA’s.
These notes help us quickly identify the MCRA for unmatched DNA matches, they quickly identify the matches we need to work to make official, and they give us line numbers we can use later to search for all matches with the same MCRA.
An example of how we use these Notes
After we’re done identifying and validating all of the DNA matches with Hints, and we start trying to identify how other matches tie into us, and what we can learn from those ties, we’re going to start with some very limited ways to identify the MCRA and/or the path to that MCRA. In this example we have a DNA match that has an unlinked Public tree, but with a strong 48 cM of shared DNA.
When we click through to the test page, and select the unlinked tree, we find a dead-end: everyone is Private.
In GEDmatch we’d have a series of tools we could use to narrow down this match, but in AncestryDNA we can’t tell if this match is even Paternal vs. Maternal. But, if we go into the “Shared Matched” and look for any kits listed there that have a Note, we can click the note and see who they match. In this case, they match our Lila Miller match, which has the note we made before attached, so we know this match also is likely to be on Rick’s MMF’s line, and that they are also on Line 11. We will make the same note in this match, and return to our searching.
These notes will be invaluable as we get into the next installment of “Making the Most of your AncestryDNA Matches”, and use Ancestry as deeply as we can to build out/prove our family trees.
Until next week, update all of your Hints with the proper Notes, and we’ll start blazing new trails!
As we approach Christmas 2018, and given the massive push to have cheap DNA tests given out as gifts this season, it seems natural to finally write a series on how to make genealogical use of a DNA test you, or your loved one, may have just taken.
We’re going to start with the very basics on how DNA testing works, and walk through both how to leverage AncestryDNA to track down ancestors as well as using GEDmatch and other advanced tools to go even deeper.
Assuming you have a few weeks before the test results are in, here are a couple of things to learn and prepare before you dive into the matches.
First, understand that while the commercials like to highlight the joys of learning your ethnicity, DNA testing raises serious issues that will likely come up as your journey progresses. You may uncover family relationships, both inside and outside of your family, that could have serious negative impacts on people. We’ve uncovered children born outside of marriages that were never known to the family, and we know of adopted children who were outed by tests where their parents had never told them. We wrote about an example of this last year (Dangers of DNA Testing).
Second, they key to effectively making matches will be a good, solid family tree through the test subject’s 4x Great Grandparents. Most of your matches made will be through 3x or 4x GGP, and in a perfect world the match will also have a good tree so the link will be obvious. We can’t over state this, or stress it enough: your success/failure of matching DNA tests from unknown relatives will rely on the quality and depth of your tree. We’ve walked through how to build a good “quick and dirty” Public tree on Ancestry (Building a good Public Ancestry.com tree – Part One: sources, citations, facts, and proof), and the process would be about the same on other sites, many of which are free.
It’s also important you have the tree available publicly…many of your interactions are going to be about exchanging trees to build a match. It’s ok if you have just a skeleton tree with basic information(names, date of birth/death, locations, children, etc.), but it will be key that you have something available publicly.
The main new term/concept you’ll need for effective Genealogical DNA research is a measure of distance: centimorgan (cM). Now, it’s not technically distance…but for all intents and purposes, it’s used as a measure of distance.
What does cM measure?
Centimorgan measures length of DNA strands. More specifically, it will be used to measure the length of matching DNA segments between your test and a test that is a genetic match. For example, you have roughly 6800 cM if you take all 22 chromosomes and strung them out end-to-end, and your matches will have varying levels of matching DNA, measured by centimorgans.
How do we use centimorgans to identify matches?
Since you get about 50% of your DNA from each parent, your DNA tests will match a test from your parents with about 3400 cM. You will match a Grandparent with about 1700 cM (50% of your parent’s 50%). The more cM you match someone, the closer a relative they are, and the more likely that you will confirm a match with them.
We’ll use both charts from ISOSG (The Shared cM Project table) and an interactive version of that chart from the DNA Painter site (Shared cM Interactive Tool), which both break down the average cM to expect with various relatives, and helps us identify where to look to establish a match. For example, if a match is 311cM then we can guess they match the person with the DNA test at around a 1st or 2nd cousin…which means our common ancestor is likely a Grandparent or Great Grandparent, which narrows down our search!
So, there’s the first part of this DNA journey. There’s a little homework while you wait for the test results, a basic understanding about how we’ll actually leverage the DNA to make matches, and why your basicGenealogy and a solid family tree will be key to this process. Next week, we’ll go over what to do when you first get your DNA results!