I removed zero value outputs, excluded 20,412 provably unspendable outputs, and consolidated outputs to the same signer into a single claim. Here are some updated distributions (which if you are like me and like stats may find interesting beyond just this concept).
All stats are as of block
305,303 but I doubt it has changed significantly through the most recent block. I would expect the number of non-standard scripts, native-multisig, and PubKey outputs to decline as they are spent and replaced with an increasing number of P2PkH and P2SH outputs which means that snapshots should become more efficient over time.
Distribution Num Claims Claim Value
------------------------------------------------------------------------
Valid Claims 2,983,463 12,879,829.77705084 BTC
Invalid Claims 16,744 2,610.01397770 BTC
Total Claims 3,000,207 12,882,439.79102854 BTC
Distribution Num Claims Claim Value Pct of Valid Pct of Total
-----------------------------------------------------------------------------------------------------------------
0.01 to 0.10 bits 470,881 0.00725863 BTC 0.0000 % 0.0000 %
0.11 to 1.00 bits 238,560 0.06057174 BTC 0.0000 % 0.0000 %
1.01 to 5.43 bits 215,689 0.06049534 BTC 0.0000 % 0.0000 %
5.44 to 10.00 bits 35,806 0.09588866 BTC 0.0000 % 0.0000 %
10.01 to 54.30 bits 49,355 0.22008615 BTC 0.0000 % 0.0000 %
54.31 to 100.00 bits 234,955 12.46195347 BTC 0.0001 % 0.0001 %
100.01 to 1,000.00 bits 447,226 166.14026722 BTC 0.0013 % 0.0013 %
1,000.01 to 10,000.00 bits 432,314 1,535.80230367 BTC 0.0119 % 0.0119 %
10,000.01 to 100,000.00 bits 541,229 16,031.22170978 BTC 0.1245 % 0.1244 %
>0.1 BTC to 1 BTC 275,469 96,829.33301374 BTC 0.7518 % 0.7516 %
>1 BTC to 10 BTC 203,095 572,033.28805785 BTC 4.4413 % 4.4404 %
>10 BTC to 100 BTC 98,567 3,531,749.15046627 BTC 27.4208 % 27.4152 %
>100 BTC to 1,000 BTC 13,169 2,997,803.73298225 BTC 23.2752 % 23.2705 %
>1,000 BTC to 10,000 BTC 1,415 3,111,492.00522526 BTC 24.1579 % 24.1530 %
>10,0000 BTC to 100,000 BTC 97 2,242,337.41607392 BTC 17.4097 % 17.4062 %
>100,000 BTC 2 309,838.93715729 BTC 2.4056 % 2.4051 %
Minimum Claim Num Claims Claim Value Pct of Valid Pct of Total
-----------------------------------------------------------------------------------------------------------------
0.01 bits 2,983,463 12,879,829.77705084 BTC 100.0000 % 99.9797 % (all valid claims)
0.10 bits 2,512,582 12,879,829.76979221 BTC 100.0000 % 99.9797 %
1.00 bits 2,296,893 12,879,829.70929687 BTC 100.0000 % 99.9797 %
5.43 bits 2,263,537 12,879,829.61522062 BTC 100.0000 % 99.9797 % (new dust treshold)
10.00 bits 2,247,538 12,879,829.48921072 BTC 100.0000 % 99.9797 % (new minimum fee)
54.30 bits 2,154,634 12,879,826.75708845 BTC 100.0000 % 99.9797 % (old dust treshold)
100.00 bits 2,012,583 12,879,817.02725725 BTC 99.9999 % 99.9796 % (new minimum fee)
1,000.00 bits 1,565,357 12,879,650.88699003 BTC 99.9986 % 99.9784 % (1 mBTC)
10,000.00 bits 1,133,043 12,878,115.08468636 BTC 99.9867 % 99.9664 %
100,000.00 bits 591,814 12,862,083.86297658 BTC 99.8622 % 99.8420 %
1,000,000.00 bits 316,345 12,765,254.52996284 BTC 99.1104 % 99.0903 % (1 BTC)
10,000,000.00 bits 113,250 12,193,221.24190499 BTC 94.6691 % 94.6499 % (10 BTC)
100,000,000.00 bits 14,683 8,661,472.09143872 BTC 67.2483 % 67.2347 % (100 BTC)
1,000,000,000.00 bits 1,514 5,663,668.35845647 BTC 43.9732 % 43.9643 % (1,000 BTC)
10,000,000,000.00 bits 99 2,552,176.35323121 BTC 19.8153 % 19.8113 % (10,000 BTC)
100,000,000,000.00 bits 2 309,838.93715729 BTC 2.4056 % 2.4051 % (100,000 BTC)
1,000,000,000,000.00 bits 0 0.00000000 BTC 0.0000 % 0.0000 % (1,00,000 BTC)
A large portion of the "custom scripts" can be refactored to be a standard template or they are invalid. I now have only 20 scripts which are potentially valid and can't be reclassified. Another interesting thing is there are a large number of native-multisig outputs which have one or more invalid PubKeys. This allows them to be refactored. I parsed all multisig outputs and removed invalid pubkeys. If the number of valid keys is less than m in m-of-n then the output is unspendable. If the number of valid keys is reduced to 1 and the script is 1-of-n then it can be refactored to a Pay2PubKeyHash script. I haven't hashed PubKeys (to convert them to PubKeyHashes) yet. Doing so may allow further consolidation of the output set (for keys where there are both PubKeyHash and PubKey outputs for the same key).
The size of the snapshot will depend heavily on the min claim value used. If we assume the average record is 24 bytes (20 byte identifier and 4 byte for value) that would make a full blockchain snapshot ~72MB. It would be up to the spinoff developer but I believe to reduce size a min claim of 10, 100 or 1,000 bits is defendable. That would result in a snapshot of 54 MB (25% reduction), 48 MB (33% reduction), and 38 MB (47% reduction) respectively. Even with a minimum of 1,000 bits the included claims represents > 99.99% of the spendable BTC in the blockchain.