Thanks for the old story archive in the wiki, I collected some interesting stats
I've been trying to compile a dataset of ballbusting stories to play around with machine learning stuff and wanted to take a moment to thank whoever decided to start archiving the old posts in the wiki. The reddit API makes finding old stuff rather painful, but thanks to the wiki archive I was able to acquire nearly twice as much data.
I think I should at this point have a copy of every single post on the sub in my dataset, aside from the ones that are links to external sites. I decided to calculate 2 statistics out of personal curiosity. The number of posts each user has created and the number of posts that have a particular score.
In this list the number on the left is a score, and the number on the right is the number of posts that have a score at that level or higher.
>0: 3749,
1: 3716,
2: 3700,
3: 3684,
4: 3660,
5: 3633,
6: 3595,
7: 3563,
8: 3516,
9: 3476,
10: 3425,
11: 3389,
12: 3342,
13: 3297,
14: 3236,
15: 3176,
16: 3117,
17: 3056,
18: 3001,
19: 2928,
20: 2852,
21: 2771,
22: 2697,
23: 2624,
24: 2555,
25: 2472,
26: 2392,
27: 2323,
28: 2242,
29: 2166,
30: 2099,
31: 2037,
32: 1983,
33: 1918,
34: 1846,
35: 1779,
36: 1721,
37: 1668,
38: 1616,
39: 1575,
40: 1524,
41: 1490,
42: 1428,
43: 1377,
44: 1333,
45: 1291,
46: 1248,
47: 1208,
48: 1157,
49: 1116,
50: 1073,
51: 1046,
52: 1007,
53: 952,
54: 917,
55: 889,
56: 862,
57: 835,
58: 802,
59: 778,
60: 744,
61: 721,
62: 700,
63: 681,
64: 664,
65: 644,
66: 625,
67: 599,
68: 585,
69: 566,
70: 545,
71: 532,
72: 506,
73: 486,
74: 470,
75: 449,
76: 425,
77: 414,
78: 401,
79: 387,
80: 375,
81: 368,
82: 355,
83: 343,
84: 333,
85: 322,
86: 313,
87: 302,
88: 294,
89: 287,
90: 279,
91: 270,
92: 260,
93: 250,
94: 243,
95: 233,
96: 225,
97: 219,
98: 214,
99: 204,
100: 201,
101: 196,
102: 192,
103: 183,
104: 177,
105: 167,
106: 162,
107: 156,
108: 153,
109: 152,
110: 149,
111: 143,
112: 135,
113: 130,
114: 128,
115: 127,
116: 124,
117: 122,
118: 118,
119: 116,
120: 113,
121: 109,
122: 107,
123: 103,
124: 100,
125: 96,
126: 89,
127: 85,
128: 83,
129: 82,
130: 81,
131: 79,
132: 77,
134: 73,
136: 70,
137: 68,
138: 65,
142: 62,
143: 60,
144: 60,
145: 58,
146: 57,
147: 57,
148: 54,
149: 53,
150: 52,
151: 51,
152: 50,
153: 48,
154: 45,
155: 44,
156: 42,
157: 42,
158: 41,
159: 38,
160: 37,
161: 35,
162: 35,
163: 34,
164: 34,
165: 32,
166: 31,
167: 30,
168: 30,
169: 30,
170: 28,
174: 27,
175: 27,
176: 26,
177: 25,
180: 24,
187: 23,
188: 21,
189: 21,
190: 20,
191: 19,
192: 18,
193: 17,
194: 16,
199: 15,
200: 14,
201: 13,
204: 12,
210: 11,
223: 10,
224: 9,
242: 8,
259: 7,
260: 6,
263: 5,
267: 4,
293: 3,
335: 2,
402: 1
This list shows how many posts have been made by various users (I cut out everyone with only 1 post due to character limits on this post).
> ('Bardiyo', 2),
('thirdgwaccount', 2),
('RedOtter7', 2),
('slimboi725', 2),
('Cicero31', 2),
('ballahier', 2),
('HornyInkedCouple', 2),
('C1C123', 2),
('g00n4me', 2),
('MasterMicka', 2),
('kinky\_story\_teller', 2),
('game\_dreamer', 2),
('69BigDickWizard69', 2),
('Femdomgoddesses', 2),
('anyrando', 2),
('Suspicious\_Ad\_5037', 2),
('superl0ve2021', 2),
('anab45', 2),
('Emobii', 2),
('Pechorine', 2),
('fbooter19', 2),
('ILikeJazzDoUToo', 2),
('RascalFlatt3465', 2),
('BustingWriter', 2),
('z0eythedestroyer', 2),
('Bye\_Bye\_BBB', 2),
('Randomusername50', 2),
('mmrr1313', 2),
('yesimthat\_guy', 2),
('richyjohn', 2),
('ComparisonOrdinary44', 2),
('Namazike', 2),
('HuckleberryOK420', 2),
('PreCut757', 2),
('ChiTownMikeyJ', 2),
('NNNataliaa', 2),
('speedbagkid', 2),
('Throwaway1627991', 2),
('AnnieBigBalls', 2),
('BallsackSlurper', 2),
('No-Preparation3439', 2),
('Impressive\_Bell4481', 2),
('Se7enrp', 2),
('wanderingmind5', 2),
('Ambitious-Internal98', 2),
('JingliG', 2),
('sj07510', 2),
('Be69ToMe', 2),
('Flick\_Connected', 2),
('Pale-Firefighter-620', 2),
('Independent\_Phase127', 2),
('Educational\_Voice458', 2),
('SharksPornAccount', 2),
('Suspicious-Anonymous', 2),
('TheyWhoHaveAnOpinion', 2),
('statege2', 2),
('Kinkdraws', 2),
('MaxPacct', 2),
('bigg60931', 2),
('TIPCAQ', 2),
('horned-dog', 2),
('Tight\_Champion\_5759', 2),
('TheLastWitchOfSA', 2),
('TheSexiSiren', 2),
('EkatyBallsMusher', 2),
('MaximumAtmosphere834', 2),
('Itchy-Yogurtcloset38', 2),
('Erotic-Habit', 2),
('Adaniya\_Burcekova', 2),
('ZooWeeMamaisgod', 2),
('holymoly2', 2),
('BrattyDwi', 2),
('Ballbusterhobby', 2),
('-Xenn-', 2),
('BustThemBets', 2),
('bruinsfan7677', 2),
('Mood\_Massive', 2),
('femboyslut73', 2),
('Great\_Wolverine\_7344', 2),
('Impressive\_Mix6368', 2),
('HornyDemoness', 2),
('AccordingPackage5318', 2),
('Claypexxten', 2),
('R00ST3RH3AD', 2),
('One\_Above\_Null', 2),
('Gloomy\_Decision2177', 2),
('skdadleskodle', 2),
('Titanmilo13', 2),
('Cat\_Blog69', 2),
('notapornthroaway\_', 2),
('RecognitionNew9938', 2),
('Dorrian69', 2),
('ItalianStalionRex', 2),
('bbaific', 2),
('CbtSimp', 2),
('Euphoric\_Home1403', 2),
('Tight\_Antelope\_2659', 2),
('Normal-Enthusiasm336', 2),
('snookywookyBB', 3),
('SexyStoryWriRe15', 3),
('MrBallCrusher', 3),
('MassiveMango1', 3),
('Busydaddy2', 3),
('j2202412', 3),
('sb\_0542', 3),
('infinitebottom', 3),
('DaymienDazed99', 3),
('BullseyeBriefs', 3),
('Alive\_Armadillo6707', 3),
('Mithridates120bc', 3),
('nyc505', 3),
('EH5', 3),
('throwawaygsf', 3),
('JoeUSooner', 3),
('seandalhousie', 3),
('DawDude77', 3),
('ConfidentDemand4', 3),
('Macaroni\_eater', 3),
('BlueBalls860', 3),
('miken775', 3),
('leggomyballs', 3),
('NSFWalternatealt', 3),
('bbstorythrowaway33', 3),
('KisaKicks', 3),
('MommydomFrey', 3),
('FantasyBusts', 3),
('Few\_Introduction\_139', 3),
('Mission\_Student\_8462', 3),
('Emilys-Slave', 3),
('ServoKamen', 3),
('Pollytission', 3),
('EmbarrassedWolf5482', 3),
('ajm2994', 3),
('Background\_Duck\_9575', 3),
('mediosimp', 3),
('aussiegf', 3),
('EmotionArtistic7074', 3),
('CookieFourYou', 3),
('Significant\_Land\_479', 3),
('TruthOrDareBB', 3),
('Ballbusted6661', 3),
('Capital\_Worth3607', 3),
('PatheticMutant', 3),
('Sea-Interaction3078', 3),
('random654321000', 3),
('AJ\_devilll', 3),
('eightduece', 3),
('PedroFG1997', 3),
('nothingbutoddities', 3),
('dirty\_boy69', 3),
('Lyretongue', 3),
('SissyCJ6', 3),
('davidbb0825', 3),
('Electronic-Pizza-571', 3),
('per0onista', 3),
('Redwall10', 3),
('ballbustboy', 3),
('panzus', 3),
('No\_Fault\_405', 3),
('DeamGiulia', 3),
('InnerAlbum', 3),
('Blaziken768', 3),
('nickdgardner', 4),
('Lil\_P42069', 4),
('No-Fold-5133', 4),
('TitsAssNAllElse', 4),
('iwasafraidofthis', 4),
('Leading-Amount2231', 4),
('Ahbuckit', 4),
('kawaiiegirlemilyX3', 4),
('linuashy', 4),
('tiny\_treat1', 4),
('Tuco\_91', 4),
('Daredevil2201', 4),
('User12585', 4),
('MyLifeWasGiftedToMe', 4),
('Unique-loss6715', 4),
('Iess7', 4),
('PowerfulQueenElena', 4),
('ls\_alessio', 4),
('AlxndrTGreat', 4),
('PruneTraditional4411', 4),
('ZookeepergameOk4522', 4),
('jimbean485', 4),
('Charming-Operation86', 4),
('zacattack04', 4),
('Kubbelstone928', 4),
('No\_Cookie9149', 4),
('Affectionate\_Dig\_312', 4),
('metarfoma', 4),
('SnooPandas7659', 4),
('meows\_n\_moans', 4),
('Serious-Loss', 5),
('Agent\_BB86', 5),
('Psychological\_Food46', 5),
('ZealousidealLab7884', 5),
('MaitrerSwitch', 5),
('BBStorytime', 5),
('Elegantquietperson', 5),
('LudwigVanDutchOven', 5),
('ed\_truck2022', 5),
('SpecialistPension0', 5),
('Burned7819', 5),
('Alive\_And\_Well\_', 5),
('Affectionate-Egg26', 5),
('FastidiousPsychology', 5),
('princessnutcrusher', 5),
('Zestyclose\_Eye\_8429', 5),
('Rough\_Cranberry\_8999', 5),
('Cgdhdyt', 5),
('Natalya\_Roman', 6),
('Gonutslmao', 6),
('CookiesFourYou', 6),
('RapidAnt', 6),
('Classic-Leg6007', 6),
('BadiolaBB', 6),
('dr\_fugazi', 6),
('VforVirtuoso', 6),
('MsCellMcSplice', 6),
('throwtheseballsaway', 6),
('Miserable\_Shallot\_50', 6),
('Unique-Somewhere-671', 6),
('lapatada4', 6),
('Unusual\_Judgment1182', 6),
('wambamshazam', 6),
('hawaiianexplorer', 6),
('808sbb', 7),
('chinesefox97', 7),
('Friedes\_Evil\_Twinsis', 7),
('Chef-Emily', 7),
('ArgelTal97', 7),
('-This\_is\_my\_username', 7),
('AngryGirlIsHere', 7),
('Ok\_Fruit\_1912', 7),
('borntobebusted', 7),
('CBTOnly', 7),
('MikeGZ1989', 7),
('SadFee9231', 7),
('Interesting\_Sea1554', 7),
('BustedNut007', 7),
('luv2bbare', 8),
('UMalah', 8),
('britneyhalls', 8),
('Emily-Perry', 8),
('\_picklefin\_', 8),
('International\_Put637', 8),
('Various-Tiger-235', 8),
('Unheard-Lichmail', 8),
('No\_Presentation7767', 8),
('hunting4pics', 9),
('PicketFenceLover', 9),
('HospitalSalt5752', 9),
('JaemeTS', 9),
('Boatlalebiutsateiang', 9),
('gesushuston', 9),
('joes9', 9),
('prankof05', 9),
('kavishmehta0612', 10),
('rgii55447', 10),
('cracked-eggs', 10),
('fumanchew86', 10),
('Key\_Art2647', 10),
('bb\_terry', 10),
('SillyLean1267', 10),
('groundundermyfeet', 10),
('confusedbb66', 10),
('JJA122', 11),
('Beckett15', 11),
('BBstoryx', 11),
('Royal-Rise-6897', 11),
('sargent\_salami', 11),
('BallbustingFanatic', 11),
('just\_a\_random-human', 12),
('ptooms19', 12),
('Feeling\_Ad64', 12),
('StarlaBBB', 12),
('precou', 12),
('NaturalRubberDuck', 12),
('TheBusted', 13),
('JoeCheese13', 13),
('Professional-Bad3825', 13),
('Anonatiger', 13),
('randomprivat3acc0unt', 14),
('SoleMann\_', 14),
('future22110', 14),
('bloobybear', 15),
('bbAhuer', 15),
('SciFiLit', 15),
('RhinoTale779', 15),
('StackHack77609', 16),
('BustedPlums', 16),
('Imbarelyhere\_01', 16),
('CaptainNutsCrunch', 16),
('DoYouRemmemberMe', 16),
('LeoFalchi', 17),
('BelkanSu37', 17),
('torrecgm', 17),
('Individual-Corner276', 17),
('machobda', 18),
('Janay\_Jackson', 18),
('BxllBxst', 18),
('TumbleweedBulky9603', 18),
('DantelikeBBQ', 18),
('BBfairytales', 18),
('53550', 18),
('smackMyNuts', 19),
('peter\_ray79', 19),
('CanisLupi', 19),
('No\_Woodpecker\_577', 20),
('tjones2425', 21),
('notBBaccount', 23),
('MikeHawk6902', 24),
('LassannnfromImgur', 27),
('OuchMyTestes', 27),
('MakoYaoyorozu', 28),
('DArchivist2', 30),
('arfio75', 30),
('InterestingParking51', 33),
('LodestarLoser', 35),
('Hummingbird-Goal', 35),
('formerlyardvark', 37),
('NathanielBallstorn', 38),
('Muted-Beginning-4103', 40),
('cupshka', 49),
('sxllybxii', 56),
('BdanmanBB', 58),
('mikik144', 61),
('LastofImio', 61),
('Crack\_my\_nuts', 62),
('havldavl', 65),
('Ok\_Comb5279', 67),
('YogurtclosetNew6242', 68),
('smasher6446', 113),
('funkybusted', 156),
('deleted', 489)
Anyways I'm going to see if I can clean up the data and try to use the story posts to teach a pre-trained LLM how to write ballbusting stories. I think there should be enough data here for soft prompt or LORA fine tuning. It will be a fun experiment anyhow.
P.S. Anyone know where I can find more ballbusting stories? Regular training usually runs on a different order of magnitude than what I pulled out of this sub. I probably never will go as far as trying to train an AI from scratch, but having more data to work with is never a bad thing.
P.P.S. Would I be within my rights to share this dataset with the community? It contains all the text of all the stories posted on the sub. Many of them I will likely modify for machine learning purposes (removing disclaimers, author's notes, links to other stories in a series etc etc...) I made sure to capture the authors as I was collecting data, but I still worry a bit about plagiarism/copyrights.