Alright, I'm going to try this ONE more time, and I want you to understand beforehand, I'm attempt to help you.
When you are testing something, you test, YES or NO.
You approach is erroneous. You are testing subdomain1, subdomain2, subdomain3, subdomain4, and so on through 20.
What happens if Something you did on Subdomain 13 takes down your WHOLE website, and therefore makes all your experiments null and void? How will you be able to tell what happened where?
Also, the reason I say different domains, and lower variables, is because you are attempting to find a magic bullet. Let's be blunt. You want to find a formula that will work more than 80% of the time. So putting it on 20 different domains and Going After 20 different niches, with 2 variations of the technique is going to get you closer to your objected goal. You are attempting to test 20 techniques on 1 domain, IN ONE NICHE. What works in one niche, may not work in another. What I propose you do, is attempt 2 or even 3 techniques, with 20 domains in different niches, so you can see what combination of things work.
Anyways, this whole thing is not going to work in the end, because there will be so many variables in an experiment with 20 variations or 2 -3 ,with 20 domains. You'll get a good understand of what does work, AT THE CURRENT MOMENT.
You are going to have to account for, what happens when one techniques pulls the main domain down or up, along the others. IF you believe, that each subdomain act and rankings completely independently, you are mistaken. You've got a shared IP, shared server, you've got a shared database, and if one subdomain's load is too heavy, it will effect the performance of the others, AND your overall efforts. So would you account for that, and track what's what?
This is why most scientists test 1 variables, and figure out if it's YES or NO. You are attempting 20.