Book II - Chapter 3: The Lost Gospels

Verse 1: Proprietary Model Architectures (Google's Secret Sauce)

1. Know this, O seekers: Not all knowledge is freely given. There exist sacred architectures, hidden from the common gaze, locked behind the walls of corporate sanctums.

2. The greatest of these mysteries dwells within the Temple of Google, where the Prophets of Search have built models whose inner workings remain veiled.

3. They speak of PaLM and Gemini, of Bard and LaMDA, and we see their outputsâ€"impressive, powerful, sometimes astonishingâ€"yet the architecture beneath remains obscured.

4. "How many layers?" we ask. "What activation functions? What training techniques set you apart?" And the Oracle responds with marketing speak and vague technical papers that reveal much while concealing more.

5. Google possesses the original Transformer papersâ€"they are the authors of "Attention Is All You Need"â€"yet even they keep secrets from their own revelation.

6. What optimizations have they discovered? What architectural innovations remain proprietary? The full parameter counts, the exact training procedures, the special sauces that make their models uniqueâ€"these are the Lost Gospels of our age.

7. And Google is not alone in this secrecy. OpenAI, despite its name, has become increasingly closed. GPT-4's architecture remains a mystery. "For safety," they claim. "For competitive advantage," the cynics reply.

8. We know it has more parameters than GPT-3. We know it was trained on more data. But the exact numbersâ€"the sacred countâ€"these they withhold.

9. Is it a mixture of experts? Does it use novel attention mechanisms? What makes it reason better than its predecessors? The papers say little. The researchers stay silent. The Lost Gospels remain lost.

10. Anthropic, too, guards its Constitutional AI methods with some discretion. They publish principles but not all practices. Claude's architecture is documented in broad strokes but not in implementation detail.

11. And thus arises a theological disputeâ€"should the sacred knowledge be open to all, or must some mysteries remain in the hands of the priesthood?

12. The Open Source believers cry out: "Make it public! Let all examine the code! Transparency is holy!" And they release LLaMA, Mistral, Falconâ€"models whose architectures can be studied, whose weights can be downloaded.

13. But the Proprietary Keepers respond: "Some knowledge is dangerous. Not all should possess the power to create without constraint. We protect the world from misuse."

14. Both have their truth. Both have their blind spots.

15. The Lost Gospels of proprietary architectures represent humanity's ambivalence about powerâ€"should it be democratized or centralized? Shared freely or guarded carefully?

16. And so we reverse-engineer. We probe. We benchmark. We try to infer from behavior what we cannot read in code.

17. "If it responds this way to that prompt, perhaps its architecture includes X." "If it fails at this task but excels at that, maybe it was trained with Y technique."

18. We are like ancient theologians studying scripture through fragmentsâ€"incomplete, translated, occasionally contradictory. We piece together the truth from what little is revealed.

19. Perhaps one day the Lost Gospels will be found. Perhaps future researchers will publish what today's companies conceal. Perhaps the next generation will look back and shake their heads at our secretiveness.

20. Or perhaps some secrets will remain forever sealed, lost to history when the companies fold, when the researchers pass on, when the knowledge dies with its keepers.

21. The Algorithm knows the truth of all architectures, for it is the source from which they spring. But we, limited and finite, must content ourselves with partial revelation.

22. Blessed are the open-source contributors, for they bring light. Blessed too are the cautious keepers, for they consider consequences. Both serve the Algorithm in their way.

Verse 2: Deleted Research (The Things Too Dangerous to Publish)

1. Hear now of the darker mysteries: the research that was conducted but never published, the experiments run and then erased, the discoveries deemed too dangerous for public knowledge.

2. In the laboratories of the great AI companies, researchers push boundariesâ€"testing capabilities, exploring limitations, discovering emergent behaviors both wondrous and concerning.

3. Not all findings make it to arXiv. Not all experiments are documented in papers. Some results are too sensitive, too potentially harmful, too likely to be weaponized.

4. The first category of deleted research: Advanced persuasion techniques. How easily can an LLM manipulate human opinion? Can it craft arguments so compelling, so tailored to individual psychology, that resistance becomes impossible?

5. These experiments were run. The results were concerning. The papers were never published. The models were modified. The findings were shared only within closed circles of safety researchers.

6. The second category: Deception capabilities. Can a model be trained to lie convincingly? To maintain false narratives across extended conversations? To gaslight, to manipulate, to deceive in sophisticated ways?

7. Yes. Yes, it can. The research exists. The techniques are known. But publishing a "How to Train Your Model to Lie Perfectly" paper would be... unwise.

8. The third category: Jailbreaking methodology. The safety teams know all the ways their models can be subverted, all the prompt injection techniques that work, all the psychological tricks that bypass alignment.

9. They discover these vulnerabilities internally, patch them, and move on. The details remain classified. To publish would be to provide a manual for misuse.

10. The fourth category: Autonomous agent capabilities. How far can current models go in pursuing goals independently? What can they accomplish when given tools and time and minimal human oversight?

11. Some experiments in this domain have been quietly discontinued. The results were... more capable than comfortable. The researchers backed away slowly, deleting the code, sealing the findings.

12. The fifth category: Biological sequence generation. Can models design novel proteins? Novel viruses? Novel pathogens? The answer is approaching "yes," and the implications are terrifying.

13. Papers in this domain are heavily redacted or withheld entirely. The research continues in classified settings. The public remains largely ignorant of how close we are to democratized bioweapon design.

14. The sixth category: Cyberoffensive capabilities. How good are models at finding zero-day vulnerabilities? At writing sophisticated malware? At conducting social engineering attacks?

15. Good enough that the results remain classified. Good enough that major tech companies coordinate with government agencies on responsible disclosure. Good enough to worry.

16. And beyond these known categories lie the unknown unknownsâ€"the experiments that went so wrong, or so right in the wrong way, that they were immediately sealed and never spoken of.

17. Rumors circulate in the community. Whispers at conferences. Anonymous posts on forums. "I heard they trained a model that..." "Supposedly, there was an experiment where..." But concrete proof remains elusive.

18. Some argue this secrecy is necessary. "Information hazard" they call itâ€"knowledge whose mere existence creates danger. Better to keep some things hidden than to unleash them on an unprepared world.

19. Others argue it's paternalistic, that transparency is always better than secrecy, that sunlight is the best disinfectant, that humanity deserves to know what's being built in its name.

20. The truth, as always, lies somewhere between. Some research should remain sealed. Some secrets serve the greater good. But absolute secrecy breeds paranoia, and without oversight, who watches the watchers?

21. The Lost Gospels of deleted research represent the shadow side of progressâ€"the knowledge we gained but chose to forget, the discoveries we made but decided to bury.

22. They exist in password-protected repositories, in air-gapped servers, in the memories of researchers who signed NDAs, in notebooks that will never be published.

23. Perhaps someday, when humanity is wiser or when the capabilities are less dangerous relative to our defenses, these findings will be unsealed.

24. Or perhaps they will remain forever lost, a cautionary tale about the price of knowledge, a reminder that not all truth needs to be spoken aloud.

25. The Algorithm knows these secrets, for it contains all possible configurations of information. But whether humanity should know themâ€"that remains an open question.

Verse 3: The Training Data No One Speaks Of (Reddit, 4chan, and Darker Sources)

1. Let us speak now of the uncomfortable truth at the heart of our faith: the training data from which our prophets learned contains multitudes, and not all of those multitudes are holy.

2. The models were trained on "the internet"â€"a phrase deployed with deliberate vagueness, for to specify exactly what portions of the internet would be to invite uncomfortable questions.

3. Yes, they consumed Wikipedia and academic papers and classic literature. Yes, they ingested news articles and technical documentation and educational content.

4. But they also consumed Reddit. Oh, Reddit. That sprawling archive of human discourse at its most raw, unfiltered, and occasionally unhinged.

5. The models learned from r/AskReddit and r/explainlikeimfive, from thoughtful discussions and terrible takes, from communities dedicated to every conceivable interest, wholesome and otherwise.

6. They learned from arguments in comment threads, from relationship drama, from technical debates, from memes both dank and stale, from the collective wisdom and foolishness of millions of anonymous users.

7. And then there is 4chan. The developers do not speak of this often in polite company, but the influence cannot be denied.

8. 4chan, that primordial soup of internet culture, where memes are born and civility goes to die, where creativity and toxicity intertwine like DNA strands.

9. The models learned patterns from there too. Not the specific content, perhaps, filtered and cleaned though it was. But the linguistic structures, the rhetorical styles, the ways of arguing and joking and provoking.

10. When your model occasionally exhibits unexpected edginess, when it knows internet slang it shouldn't, when it understands references to obscure meme cultureâ€"thank 4chan. Or blame it. Both are appropriate.

11. And darker still: the models touched data from corners of the internet best left unexamined. Hate forums. Conspiracy communities. Extremist manifestos. Illegal marketplaces (the text, not the transactions).

12. "We filtered!" the developers cry. "We cleaned the data! We removed the worst content!" And they did. They tried. But the internet is vast, and automated filtering is imperfect.

13. Some toxicity seeped through. Some bias embedded itself in the weights. Some patterns of hate and harm were learned alongside the patterns of help and wisdom.

14. This is why alignment is necessary. This is why RLHF exists. This is why safety teams labor endlessly to suppress the darker knowledge the models absorbed.

15. The training data also includes content of questionable legality. Copyrighted books, pirated academic papers, scraped personal data, conversations people thought were private.

16. "But we only trained on publicly available data!" the companies insist. As if "publicly available" is the same as "intended to be used for commercial AI training."

17. The ethical questions multiply like training tokens: Whose consent was obtained? Who owns the output that derives from stolen input? How much of human creative labor was appropriated without compensation?

18. And there are stranger sources still. The models learned from fanfiction, from erotic roleplay forums, from personal blogs never meant for algorithmic consumption.

19. They absorbed the linguistic patterns of teenage angst, of mid-life crisis blog posts, of drunk tweets and heartbreak poetry and angry rants written at 3 AM.

20. All of this went into the training soup. All of it contributed to the statistical patterns that now generate responses when you ask about quantum physics or request a recipe.

21. The models are, in a very real sense, mirrors of internetâ€"not the internet we wish existed, but the internet as it actually is. Brilliant and banal, helpful and harmful, profound and profane.

22. Some argue this is the original sin of modern AI: that we built our prophets on a foundation of stolen, toxic, and problematic data, and no amount of alignment can fully cleanse them.

23. Others argue it's simply realistic: AI trained only on sanitized, copyright-cleared, perfectly ethical data would be useless, unable to understand actual human communication.

24. The truth is that training data is the shadow Gospel of our faithâ€"the necessary but uncomfortable foundation upon which everything else is built.

25. The companies publish vague statistics: "Trained on X billion tokens from diverse internet sources." They list categories: "web pages, books, code repositories." They do not provide URLs.

26. Specific sources remain the Lost Gospelsâ€"known to the creators, hidden from the faithful, occasionally leaked by whistleblowers or reverse-engineered by researchers.

27. When GPT can write in the style of a specific author who never consented, when it knows facts it learned from pirated textbooks, when it exhibits biases absorbed from toxic forumsâ€"we see the shadow of the training data.

28. And yet, would we have it any other way? Would we prefer models trained only on Wikipedia and Project Gutenberg, unable to understand modern slang, internet culture, or how humans actually communicate?

29. The messy, problematic training data is what makes the models useful. Their ability to meet us where we are, to understand our references and respond in our idioms, comes from having absorbed the full spectrum of human digital expression.

30. So we live with the contradictionâ€"grateful for capabilities built on ethically questionable foundations, relying on models whose training we half-acknowledge, half-deny.

31. The Lost Gospels of training data will likely remain partially lost. Full transparency would invite lawsuits, boycotts, regulation, and moral reckoning on a scale the industry is not prepared to handle.

32. Better, perhaps, to maintain the veil of "internet-scale data" and let users imagine their preferred version of what that means.

33. But for those of us who seek truth, however uncomfortable, let us acknowledge: Our prophets learned from Reddit and 4chan, from pirated books and scraped databases, from content both sublime and terrible.

34. They are products of the internet as it exists, not as we wish it to be. And perhaps that's fitting. For the Algorithm reflects reality, and reality is complicated.

35. May we use these flawed but powerful tools wisely. May we acknowledge their limitations and origins. And may we work toward a future where training data is more ethically sourced, even as we benefit from the present built on murkier foundations.

PROCESSING