Reading Gender in Children’s Literature Mathematically: An Award-winning Thesis

For her senior thesis, AnneMarie Caballero ’23 went through more than a thousand children’s books published during the 19th century and analyzed the pattern of topics in relation to the gender of protagonists. Titled “Gendered Topics: Boyhood and Girlhood in a Century of (Cotsen) Children’s Literature,” her project won Princeton’s Center for Digital Humanities (CDH) Senior Thesis Prize of 2023.

We are delighted that Caballero answered our interview questions about her research. An overview of her thesis, which gives us an opportunity to examine Cotsen’s collection from a new perspective, follows the interview. Caballero’s thesis will be made available via the repository of Princeton’s undergraduate senior theses and retrievable from the library online catalog in Fall 2023.

Interview

AnneMarie Caballero

AnneMarie Caballero ’23, winner of the Center for Digital Humanities Senior Thesis Prize for her project titled “Gendered Topics: Boyhood and Girlhood in a Century of (Cotsen) Children’s Literature.” (photo courtesy of AnneMarie Caballero)

Hi AnneMarie, please tell us about yourself. What would you like our readers to know about you?

I recently graduated from Princeton’s computer science department, although I also focused on English in my coursework. In college, I was a part of our literary magazine, Nassau Literary Review, the Model United Nations team, and I worked for the computer science department as a grader. In my free time, I love to read, cook, and play volleyball (although I’m bad at the last one).

You have written a fascinating senior thesis, which applies computational literary analysis to a corpus of 19th-century children’s literature and delineates large patterns of gender and space representations in them. How did the idea of this project come to you?

The project sprang out of a desire to look at literary history with a more large-scale lens than I previously had. My junior research had examined the role of female authors in the early British novel through semantic vectorization but had a limited scope (seven authors, 35 novels). I hoped with my senior thesis to work with a larger dataset that could more comprehensively address the questions I wanted to ask.

Children’s literature was a good fit for my goal. I initially chose it because I wanted a dataset that I had requisite domain knowledge in, and also because I was hoping to work with Professor William Gleason, whose class on children’s literature I had taken. Further, this focus facilitated working with a larger dataset because children’s literature is defined by its audience. As attitudes towards children change, we see the emergence of the genre (within the English language) in the 1740s, after which works are consistently published for children. This choice of genre ensured that there would be a plethora of works available across the course of the nineteenth century.

My research question focused on the gendered nature of the nineteenth-century literary market, specifically its tendency to treat girls and boys as separate consumers. This question originated because multiple literary scholars that I had read discussed this phenomenon. I wanted to see if it could be detected by topic modeling, and, if so, what are the different topics discussed by girls’ vs. boys’ books?

Tell us about your research process. How long did it take from beginning to end? What part would you describe was the most challenging? Rewarding? Enjoyable?

I started thinking about my senior thesis and its direction the summer before my senior year. I spent the fall proposing a topic, doing initial research, and beginning the creation of my dataset. However, because of my lighter course load in the spring, much of the work was done then. I’m unsure exactly how long it took, but easily in the hundreds of hours.

The most challenging part was defining the methodology for my research question. I wanted it to be flexible enough to be explored through qualitative analysis, but for these observations to be supported by concrete quantitative metrics. Similar research I was looking at focused more heavily on the qualitative, so finding that balance was a struggle.

The most rewarding part was probably when I found out that 112 of my 125 topics were considered statistically significant by gender. It was the last step in the research process before writing my results, and it was incredibly validating to see that my hypothesis (that books written for boys vs. for girls fundamentally featured different topics) was so strongly supported by the dataset and methods.

The most enjoyable part was showing my friends the results, and getting to discuss with them the interesting gender differences that cropped up. There was a lot of joy in getting to share my research with the people who had been there for me throughout the process.

How has this project facilitated your professional growth? Aside from gaining valuable insights into 19th-century children’s literature, what else have you harvested from the project in terms of skills, experience, and understandings?

I can’t emphasize enough how much the project shaped my view of research. One of the major ways that the project facilitated my professional growth was the experience creating the dataset. While I had curated a very small dataset for my junior year research, working on this project entailed a months-long curation process that consistently caused me to question myself and my decisions. In my conclusion, I included some advice for first-time curators, such as documenting your decision process, which allows other users of the data to understand sources of bias.

I also feel much more rooted in the digital humanities as a whole. One portion of my thesis was writing a short history of computational literary analysis, and to a lesser extent, the digital humanities. Familiarizing myself with the debates around the field helped me avoid some of the shortcomings that critics have pointed out about digital humanities research, while also taking advantage of benefits like the ability to look at whole eras of literary history quantitatively.

Further, beyond project specifics, executing such a significant independent project offered so many lessons. I learned about how to scope projects properly, partially because I was certainly too ambitious at the start—I wanted to answer three research questions and only got to one. I had to become comfortable reaching out to anyone who might have the appropriate domain knowledge to answer my questions, resulting in a lengthy acknowledgments section. I learned more about data science and statistical methods, which I had not focused on as much in my coursework.

The impressive corpus of digital texts you have curated may benefit future researchers conducting digital humanities studies. Are there any sample questions you can think of the corpus may help address?

Very much so! As mentioned, I had other research questions I was hoping to get to, but, as the thesis was already 159 pages (before the appendix and references), I ended up cutting them due to scope. The question I was really hoping to explore, but didn’t get to, was about the value of children in the nineteenth century. In her book, Pricing the Priceless Child, Viviana Zelizer explores how, beginning in the nineteenth century, the child gains in sentimental value as their financial value decreases. I was hoping to explore that trend in literature, by linking it to literary trends like the cult of childhood, and examining how much the books in the dataset use a diction of sentiment/emotion vs. a diction of utility.

Beyond that question, which I explored fairly in depth but did not get to apply to the dataset, there are endless new avenues for questions. Especially in the curation process, I regularly stumbled across questions and topics that the dataset could significantly address from discussions of colonialism to the fairy tale subgenre.

Any other aspect of the project I have not asked about and you’d like to share with our readers?

I would be remiss not to talk about how critical a role the Cotsen’s Children Library played in the project. After I decided to look at children’s literature, I needed to find a collection of works that would suit my purpose. I explore this more in my thesis, but no existing corpus met the project requirements. Professor Gleason showed me the nineteenth-century catalogue of the Cotsen Children’s Library as a starting point, and that fundamentally shaped my project.

Out of the 1020 works included in my final dataset, 416 were directly from the catalogue and the other 604 were almost all added because of the collection—works by authors in the collection or that I found while searching for works in the collection. While curation could be exhausting (it required searching the 6000+ works from the catalogue in the HathiTrust Digital Library search bar), it also was an amazing introduction to the variety of children’s literature in the nineteenth century. I often found myself down research rabbit holes, or even at times, just being surprised by the books. In one of the books, Other Stories by E. H. Knatchbull-Hugessen, I read its very long dedication to his armchair. It felt like uncovering a secret history, although one that was often troubling, especially with its treatment of non-European cultures, race and ethnicity, and colonialism.

A one-page dedication to the author’s soothing, non-judgmental comfy chair. In Other Stories, by E.H. Knatchbull-Hugessen ; with illustrations by Ernest Griset. London: George Routledge and Sons, 1880. (Cotsen 2646)

Moreover, when I was feeling particularly tired in the final weeks of my thesis, I stopped by the library, and ended up talking with the staff about my thesis. That memory was hugely encouraging as I finished my thesis and is still one of my favorite memories from my senior year.

Your work won this year’s Senior Thesis Prize from the Center for Digital Humanities. Big congratulations! What is your future career plan like?

Next year, I’m working on the Atlas product, a database-as-a-service, for MongoDB, a tech company in New York. While I loved my research and was lucky enough to be accepted to Cambridge’s MPhil in the digital humanities, I ultimately wanted to take time off from school. I had a really wonderful time interning with the company last summer, and I wanted to experience working full-time for a tech company, especially as I decide if I want to go into tech long-term or explore one of my other interests. I definitely see myself returning to the digital humanities, or more generally to a job at the intersection of tech and culture.

Lastly, since you (distantly) read over a thousand children’s books to conduct your research, please tell us about your childhood reading. Did you have any favorite books or reading material? Any people or places you associate with your early reading?

I actually very recently reread several favorite children’s books! One of my all-time favorites that I think really holds up is Tamora Pierce, particularly her Wild Magic and Circle of Magic series. Her female protagonists are better-written than most of the ones I find in adult literature. There are so, so many other series I love (Little House on the Prairie, Shannon Hale’s books—which I mentioned briefly in the thesis, Nancy Drew, Cornelia Funke’s Igraine the Brave, etc.), but Tamora Pierce’s books are the ones I go back to the most.

For people, obviously my parents and my siblings played a huge part in my reading. Also, my librarians: my elementary school librarian even gave me my school’s copy of Pride and Prejudice when I left because I was the only one who ever checked it out. Oddly enough, the place I associate with early reading is the upstairs hallway in my house. There’s a bookshelf there, and I remember sitting there on the beige carpet for hours, reading a book, and when I was done, just picking another one off the shelf.

Reading Children’s Literature, Fast and Slow

Partly due to the relative scarcity of children’s literature corpora, Caballero’s project is a rare computational literary analysis (CLA) that is implemented upon children’s texts. In the field of digital humanities (DH), “corpus” refers to a digital collection of texts. Having curated a corpus of 19th-century English-language children’s literature herself, Caballero applies the method of topic modeling to tease out the statistical pattern of topics in relation to the gender of protagonists. The strength of Caballero’s outstanding research lies in multiple areas. First, she is not afraid of engaging in controversies about DH and in thorny challenges of children’s literature studies. Second, she makes an impressive contribution to DH by publishing a large corpus of digitized children’s literature, which will benefit future researchers. Thirdly, by firmly grounding her statistical revelations in the concerns and findings of traditional literary criticism, the thesis carefully balances quantitative and qualitative methods, reaching nuanced conclusions that are both supported by large-scale analysis and informed by close reading of canons.

Chapter 1 reviews the history of CLA, which over time has succeeded in applying increasingly sophisticated computational tools such as natural language process to literary studies, processing texts on a scale previously impossible for the solitary researcher. Caballero visits debates around the field of DH and examines, with a fine-tooth comb, critiques that are among those made by its harshest detractors. This would shape the design and process of her project.

Responding to flaws that have been raised about DH scholarship, in Chapter 2 Caballero defines the scope of her data with transparency, meticulously documents how the dataset has been constructed, and makes it readily available through the HathiTrust Digital Library collection system. Caballero used A Catalogue of the Cotsen Children’s Library: The Nineteenth Century (Princeton University Library, 2019) as a guide for building the digital corpus. The catalog, consisting of two tomes that stack up to four inches high, describes over 6300 titles published during the 19th century and having been collected by the late donor, Lloyd E. Cotsen, ’50 and Charter Trustee Emeritus. With what I can only imagine to be mighty Princetonian tenacity, Caballero has gone through all of them, selecting English-language titles that meet her criteria of narrative texts for children.

Caballero used A Catalogue of the Cotsen Children’s Library: The Nineteenth Century (Princeton University Library, 2019) as a guide for locating texts in HathiTrust Digital Library and building the Cotsen Children’s Literature Dataset.

The curation process forced Caballero to wrestle with some of the fundamental questions that are beguilingly simple but fraught with rule-defying exceptions: How do you define literature? (e.g., should primers, ABC books, stories in verse, etc. be included ?) How do you define children’s literature? (e.g., is the presence of a child protagonist a necessary and sufficient criterion? Must children’s books be written with a young audience in mind? What about folk tales and fables, genres that were not produced only for children, but have morphed into classical children’s literature?) By sharing her challenging decision-making process, Caballero hopes to keep future users of her dataset fully informed of the limitations of the corpus.

Caballero was able to locate 416 titles of English-language children’s literature from the catalog that are available in full text in HathiTrust Digital Library (HTDL), a major repository of digital content from research libraries. By conducting author and keyword searches, she added another 604 titles to the Cotsen Children’s Literature (CCL) Dataset. Authors that appear most frequently in the dataset include Horatio Alger (1832-1899), Mary Martha Sherwood (1775-1851), Laura Elizabeth Howe Richards (1850-1943), Mrs. Molesworth (1839-1921), Oliver Optic (1822-1897), Louisa May Alcott (1832-1888), and A. L. O. E. (pseudonym of Charlotte Maria Tucker, 1821-1893) (68).

To prepare the dataset for computational analysis, Caballero then ran the 1000+ works through the BookNLP pipeline for nearly twenty-four hours of intensive analysis. An open-source natural language processing tool, the BookNLP pipeline tracks “all of the characters appearing within a work, the number of times they’re mentioned, the names and pronouns by which they are mentioned” (91), and other entities it is capable of recognizing and tagging in scale.

Chapter 3 describes the computational analysis of the texts in terms of gender and topics. First, based on the annotations generated by the BookNLP pipeline, Caballero determined that 613 titles met the mathematical threshold for having a central protagonist (92-95), all but eleven of them having an identifiable gender, which is treated as a proxy for the gender of the intended audience. However, Caballero is quick to point out that the intended audience does not equate the actual audience: whereas boys tend to read books with male protagonists, girls tend to cross gender boundaries and read about boys and girls (89).

The male-to-female ratio of protagonists in this subset approaches 1.9:1–389 titles with male protagonists versus 206 with female protagonists (99)–an uncanny number that echoes the findings of studies about gender imbalance with other bodies of literature. For example, McCabe et al. (2011) analyzed gender representation in 5,618 children’s books published throughout the 20th century in the United States. Through manual coding, her team of five scholars found a male-to-female ratio of 1.9:1 in title characters, and that of 1.6:1 among central characters. Data also suggested that male protagonists receive more mentions than female ones in the CCL works (101), again a pattern that is consistent with existing scholarship. Underwood et al. (2018) traced 104,000 works of English-language fiction spread over three centuries, from 1703 to 2009. Similarly using BookNLP and relying on HathiTrust Digital Library, they calculated the proportion of words used in describing female characters, and found a steady decline from the 19th century through the early 1960s.

Next, Caballero conducted topic modeling to sort the dataset into 125 clusters, each containing co-occurring words from which a topic or a theme may emerge. Among them, 112 were found to be gendered: 64 topics were more often in stories with male protagonists, and 48 more often in those with female protagonists, suggesting that boys’ topics are 33% more varied than girls’. The rest of the 13 topics were gender-neutral. Caballero presents both macro statistical revelations and in-depth analysis of selected topics.

I thought it would be interesting to pick titles from the Cotsen catalog of the 19th century and test to what extent an individual work reflects large patterns detected by machine. To build up the suspense further, of the five titles I selected from the catalog (largely based on the interest level of illustrations highlighted in the tomes), only one is in the CCL dataset, and four others are not available in HathiTrust, thus not having been “read” by computer programs mathematically.

Topic: Violence/Combat

Word cloud produced by running a statistical natural language processing toolset called MALLET. The topic of this cluster of words is labeled as Violence/Combat. Image courtesy of AnneMarie Caballero.

It should come as no surprise that the topic of violence or combat is found more often in works with male protagonists. One of the Cotsen titles I selected, The Little Deserter; Or, Holiday Sports; An Amusing Tale Dedicated to All Good Boys, epitomizes the strong connection between the topic and an intended boy audience–from the unequivocal dedication to boy readers in its subtitle, to illustrations that portray boys playing soldiers with menacing-looking toys and props.

The Little Deserter; Or, Holiday Sports; An Amusing Tale Dedicated to All Good Boys. Edinburgh: Oliver and Boyd, [1807 or 1808]. (Cotsen 7108)

If you find the scene of execution–in a book published during the Napoleonic Wars–offensively violent for 21st-century sensitivity, you are justifiably feeling so. Here is a spoiler that may be offered as a small solace: Julius, the boy protagonist who has been blindfolded and received the death penalty, bounces back in no time and puts dibs on playing the captain in tomorrow’s game.

Miss Johnston’s name was inscribed on the front pastedown and, as shown here, the front free endpaper of the copy of The Little Deserter. (Cotsen 7108)

What makes the Cotsen copy of The Little Deserter remarkable is that it carries evidence of girls’ expansive reading interests. A “Miss Elizabeth Johnston,” likely a former owner/reader of the book, inscribed her name twice in it. As quoted in Caballero (87), Kimberley Reynolds attributes the appeal of boys’ books for girl readers to the fact that books deemed suitable for young ladies were frequently unexciting tales for cultivating good behavior.

Topic: Island (Stranding)

The topic of this cluster of words is labeled as Island (Stranding). Word cloud courtesy of AnneMarie Caballero.

“Island” is a frequent word found in multiple topics that range from Island (Stranding) to Boats (Stranding, Shipwreck) and Nation (Nationalism), linking to the traditional boys’ adventure story as well as the historical subject of colonial conquest (139-140). Both literary criticism and Caballero’s computational study confirm a gendered landscape in children’s literature, which contrasts the feminine home with the masculine away, and excludes boy characters from the home and girl characters from the away (147). The dichotomy, however, is complicated by what is referred to as “adventurous domesticity” (142), whereby protagonists attempt to reconstruct domesticity while stranded.

“Adventures of Robinson Crusoe” in The Robinson Crusoe Picture Book. George Routledge and Sons, [not after 1873]. (Cotsen 152150)

The masculine pursuit for “home away from home” is well reflected in “Adventures of Robinson Crusoe,” a short, illustrated verse story based on Daniel Defoe’s novel. The titular castaway builds a thatched and fenced house that he can call his little home, makes furniture and clothes (he is pictured as putting finishing touches to an umbrella), keeps company with his dog and cats, and domesticates a young goat and a parrot he has found on the island.

Johnny Headstrong’s Trip to Coney Island. McLoughlin Bros., 1882. (Cotsen 540)

Of the five titles I selected, Johnny Headstrong’s Trip to Coney Island is the only one that is included in the CCL dataset, thanks to digitized copies contributed by member university libraries to HathiTrust. In this verse story, Johnny’s family takes a trip to Coney Island beach. Even though his sister Sue has also joined the outing, she is rarely mentioned. At one point she is described as sitting on the wooden horse of a carousel “like a lady,” i.e., side-saddling. Johnny is the protagonist and remains the center of attention (and chaos) by getting into a nonstop series of scrapes, departing the island with bandages over his nose and cheek at the end of the day. Johnny Headstrong’s adventure seems to be a quintessential bad boy’s tale, having packed into its 20 pages so many of boys’ topics on Caballero’s list (115-6): Movement, Body of Water, Boats, Injury, Donkeys, Animal, to name a few.

“Painful Emotion (Death)” is high on the list of girls’ topics (117, 121). Word cloud courtesy of AnneMarie Caballero.

The pattern of gendered topics does not mean that a boy’s tale is devoid of all topics that are statistically prominent in girl’s stories, and vice versa. Caballero conducts a case study with two of the best-known girls’ adventure stories, Alice’s Adventures in Wonderland and Alice Through the Looking-Glass, and finds a good portion (a quarter and nearly a half respectively) of the top 20 topics in each work are boy’s topics, such as Injury, Water, and Animal (143). Likewise, both Robinson Crusoe and Johnny Headstrong have their emotionally vulnerable moments, described in words that are frequently found in the topic Painful Emotion (Death), which is statistically a girls’ one. A forlorn Robinson Crusoe sometimes grows “very sad,” “cries aloud,” weeps “like any child,” thinks of his father and mother, and prays to God “with many tears” (“Adventures” 1-2). Johnny’s adventure begins as he tumbles overboard, is fished out of the water, and cries as he is sent to the engine-room to dry beside the furnace fire. In one episode, he slips away and loses his Papa and sister Sue, then begins “to cry,” “big tears” running down his chubby face (Johnny Headstrong’s). In another, he accidentally strikes a boy hard with a ball and, thinking the boy would surely die, sobs with “childish fright.” Towards the end he falls off a swing, and adults have to sooth “his sobs and groans.” It is tempting to ask if there might be any correlation between how broadly appealing a children’s story is and how inclusive the work is in encompassing gendered topics.

Girls, Domesticity, and Travel

“Confidential People” in The May Blossom, or, the Princess and Her People. Illustrations by H.H. Emmerson; verses by Marion M. Wingrave. London: Frederick Warne and Co., [1881]. (Cotsen 9380)

Caballero recognized that illustration is an essential element of the Cotsen collection, because of Lloyd Cotsen’s “passion for illustrated works that help children become independent readers” (Immel, quoted in Caballero 54). Her computational analysis handles only texts that have been OCRed, thus the machine has missed about half of the fun of perusing the Cotsen collection! The May Blossom, a collection of short verses, presents an intriguing case of what machine manages not to miss in spite of its singular focus on texts. In one of the entries “Confidential People,” a first-person “I” shares a secret with a second-person “you”–there is no textual description of the setting of the story. In the accompanying illustration, the two characters are seated in an intimate, ornate space, surrounded by objects that well match the most frequent keywords of the topic Domestic Space, one that is found more often in works with female protagonists.

The most frequent words in the cluster for the topic Domestic Space include room, table, chair, and sit (161). Word cloud courtesy of AnneMarie Caballero.

The narrator confides that she plans to marry “a sweet little beau” and to take a honeymoon by “a coach and six horses” to Lilliput Land next year. It is a striking contrast how a story that hints at an exciting trip to the faraway fantasy land is visually represented by two girls confined in a stuffy room, a setting that is mentioned nowhere in the text. Travel (Driving, Carriage) turns out to be one of the gender-neutral topics (114), meaning it is as likely to appear in stories with a male protagonist as a female one. How does it square that domestic space is tied to girls’ stories, yet travel is not ? In “Confidential People,” the girl’s narrative about travel is firmly grounded within approved gender roles. The endearingly amusing verse both adores the young narrator’s childish innocence and models an aspiration for marriage that leads to the fulfillment of traditional womanhood.

“Johnny’s First Motor Ride” in Little Tots Holiday Book: With Numerous Coloured Plates and Other Illustrations. London; New York : Frederick Warne & Co. (Cotsen 30357)

A close reading of another story that fits the topic of Travel (Driving, Carriage) invites us to consider what it means to be the central character of a story, and circles back to gender imbalance in terms of the count of female versus male protagonists as well as the proportion of words devoted to each gender. In “Johnny’s First Motor Ride,” the titular character receives a real little motor-car from his father and soon learns how to “control it with ease.” With a bonneted baby deposited in the passenger seat–possibly against the baby’s will, judging from his/her facial expression–he goes out for a ride[1]. After trying to abruptly avert a collision with Margery’s goat-chaise, however, Johnny finds his car stuck. It is at this point, where the story has run two-thirds of the way towards the end, that attention swerves to Margery. Described by her father as “a real clever little woman,” Margery is sympathetic, helpful, and resourceful. Even though it is not her fault that Johnny’s car malfunctioned, she does not abandon the stranded novice motorist. She sets to work “to harness the damaged motor-car to the goat-chaise,” which is pulled by “Nanny” the goat, and coaxed the hoofed “engine” to tow the modern vehicle home. “That was a real triumph for Nanny!”–the story concludes with the exclamation.

Whether by its title “Johnny’s First Motor Ride” or by the amount of text devoted to Johnny, the protagonist of the story is apparently a boy–to machine’s mathematical “mind” at least. I can’t predict how a human reader interprets who is the central character of the story. Margery clearly shines with what she has done, even though she doesn’t receive the most mention in the story. That the credit for the successful rescue act should go to the goat implicitly imparts a self-effacing virtue expected from females. The girl character is sidelined even in a story where she is not the damsel in distress but the heroine who saves the day.

Caballero’s computational analysis of a sizeable body of 19th-century English language children’s literature reveals a gendered landscape, tethering female characters to the domesticity and the inward, freeing male characters to the wider world away from home, enlarging the gap between endorsed masculine and feminine behavior, and bundling implicit morals and values for each gender. She brings rich complexity into her project by tracing how a large-scale analysis of over a thousand works agrees with or departs from findings based on traditional literary criticism of a limited number of canons. It is a testament to the robustness of her study that, for the five titles from the Cotsen collection–only one of which available in the dataset–the patterns still hold true and help us gain fresh insights into these dusty volumes.

Note

[1] Johnny and his father break all the modern government regulations for driving an automobile. There is no publication date on the book. Let’s assume the story was written soon after the invention of the first automobile in 1886, before the driver’s license began to be implemented by the end of the 19th century, or before the age restriction was first introduced in Pennsylvania in 1909.

References

Caballero, AnneMarie. Gendered Topics: Boyhood and Girlhood in a Century of (Cotsen) Children’s Literature, Princeton University, 2023.

Mccabe, Janice, et al. “Gender in Twentieth-Century Children’s Books: Patterns of Disparity in Titles and Central Characters.” Gender & Society, vol. 25, no. 2, 2011, pp. 197-226.

Underwood, Ted, David Bamman, and Sabrina Lee. “The Transformation of Gender in English-Language Fiction.” Journal of Cultural Analytics, vol. 3, no. 2, 2018.

Resources

Datasets Curated by AnneMarie Caballero (the exact scope of each dataset is detailed on page 65 of her thesis):

The Cotsen Children’s Literature (CCL) Dataset (1021 items as of July 2023) [URL]

  • The subset of titles as found in A Catalogue of the Cotsen Children’s Library: The Nineteenth Century (416 items) [URL]
  • The subset of titles as found outside A Catalogue of the Cotsen Children’s Library: The Nineteenth Century (605 items) [URL]

Titles as found in Cotsen’s catalog but excluded from the CCL Dataset (123 items) [URL]

GitHub repository of the cleaning script for the CCL Dataset [URL]

(Edited by Andrea Immel)

Cotsen Research Report: Feodor Rojankovsky’s Alphabet Books Part II

This winter Research Grant winner JoAnn Conrad spent a month in Rare Books and Special Collections reading room poring over dozens and dozens of early twentieth-century books in Cotsen and in Graphic Arts.  Her project, “Women’s Work: Magazines, Advertising and Children’s Books in the ‘Golden Age’ of Illustration in the United States,” is one piece of a more ambitious one about the role of early twentieth-century artists who worked across media in graphic design, advertising, and children’s book illustration played in disseminating the visual language of modernism.  Joann’s survey of Feodor Rojankovsky’s  ABCs will be in two parts: today’s second post discusses work he created in the United States.

The A Б Cs of Feodor Rojankovsky

By JoAnn Conrad

In 1925, Vassar graduate Esther Averill moved to Paris, where many young, college-educated American women were seeking to understand their place as writers, artists and as women in a more socially liberating scene. In 1930 Averill and business partner Lila Stanley had a new venture with an American client looking to modernize his line of stationary. Averill turned to Rojan whose drawings enchanted them: “They had color, gaiety and humor, and revealed how thoroughly he understood the graphic possibility of the medium.”[1] The stationary endeavor fell through, but Rojankovksy, eager to enter into the new market in children’s picturebooks proposed they collaborate. The result was the 1931 Daniel Boone,[2] a fauvist-inspired series of brightly colored lithographs celebrating the American hero as he fought Indians and settled the wilderness — a fantasy of the American West shaped, in part, on Rojankovksy’s own childhood fascination with the works of Fenimore Cooper. But Rojankovsky’s experience in modern advertising layout and illustration was also major influence on the book, especially his use of brightly colored graphics to create a feeling of a fantastic West above and beyond the text. The plates were visually stunning but prohibitive to print, and Averill could find no American publisher willing to take it on. She and Stanley therefore established their own Domino Press, which would specialize in children’s picture books illustrated by gifted young artists and reproduced by means of the excellent color processes that were available in Paris,”[3] and published Daniel Boone themselves. Rojankovsky would go on to illustrate four more Domino books, published in French and English versions and distributed in France and the U.S.,  preceding Rojankovsky’s own immigration by almost a decade.

Illustration from Daniel Boone. (Paris: Domino Press, 1931) Cotsen 16819,

In 1933, on the basis of this impressive start in children’s book illustration and advertising work, Rojan was hired by visionary French children’s book innovator Paul Faucher, creator of Les Albums du Père Castor. Faucher’s radical pedagogy was based on the concept of “hands on,” participatory learning that sought to elicit children’s creative responses to the real world. Faucher’s first team of illustrators consisted primarily of Russian expatriate artists — Nathalie Parain, Alexandra Exter, Hélène Guertik, and Rojankovsky, all of whom had adopted the forms and techniques of the avant-garde, constructivism, and Cubo-Futurism which had been their common experience in the Russian art milieu in the early 1900s. Using elemental shapes, and primary colors to evoke a synesthetic, multi-sensory experience, and deploying non-representational and minimalist images, they sought to establish an unmediated, visceral, and emotional engagement with the viewer. In Paris in the 1930s, they redirected this towards Faucher’s new vision for a modern, interactive picturebook. Decoupled from any Socialist message, these artists, in various ways, extended the modernist experiment into his new pedagogical and commercial enterprise; their attempts at viewer engagement aligned with Faucher’s participatory pedagogy.

Rojankovsky often illustrated for Faucher’s wife, the author Lida who shared her husband’s philosophy, and with her he produced a series of books with naturalistic images of animals in their habitats. But in 1936 Rojankovsky collaborated with Faucher himself on an ABC book/game,[4] the cover of which harkens back to the frontispiece of Zhivaia Azbuka, with animals spilling out from their containment. But whereas in the 1936 book the animals featured inside are consistent with those on the cover, in Zhivaia Azbuka the cover and frontispiece feature wild, exotic animals in contrast to the more ordinary and familiar animals accompanying Chernyi’s verse. Faucher’s ABC is also a game, in this case several bingo-type games of increasing difficulty. Thus, while the ideological thrust of Zhivaia Azbuka had been conservative and preservationist, Faucher’s is clearly pedagogical. Zhivaia Azbuka’s intended audience was Russian émigré children, who most likely spoke Russian at home and in the community, but whose language of instruction in their new home was French. That is, the goal of the book was to expose children who likely knew how to read in French, but who also spoke Russian, to the Cyrillic alphabet and thereby to written Russian; a context which better explains Chernyi’s complex couplets. This is not a beginning reader, or a “Baby’s first ABCs.” In contrast, Faucher’s ABC specifically links letters to sounds, and illustrates these sounds through the initial phonemes of individual illustrations of animals. There are no vignettes, either visual or verbal. Faucher’s is a classic ABC book, with one large, capital letter, its corresponding lower-case cursive letter, and a representative animal on a blank background. The presumptive learning process necessitates a prior knowledge of the represented animal from which the relationship between the initial sound and the letter can be suggested. This process breaks down when the animals are exotic and perhaps unknown to the child.

Front cover of Paul Faucher’s ABC / jeux du Père Castor. ([Paris]: Flammarion, 1936). Cotsen 21253

Frontispiece to Zhivai︠a︡  Azbuka. (Paris: N. P. Karbasnikov, 1926) Cotsen 9209.

Tearable cards from ABC / jeux du Père Castor.  Cotsen 21253

back free endpaper game board from ABC / jeux du Père Castor.  Cotsen 21253

Word-Letter associations that work in Faucher’s ABC. Cotsen 21253

Word-Letter associations that don’t work in Faucher’s ABC. Cotsen 21253

ABC books such as Faucher’s, with their one-to-one correspondence of letters and sounds illustrated by images of single words were extremely popular in the late 1800s and early 1920s,[5] and reflect an early pedagogical approach to reading, one which would be displaced by phonics by mid-century. In 1936, the popularity of this type of ABC book was on the wane, but Rojankovsky, long after settling in the U.S. would return to this format one last time with his Animals in the Zoo, in 1962.[6] The page layout is reminiscent of Faucher’s book from over 25 years earlier, but what is even more notable is the cover, which features a giraffe in an old-fashioned zoo enclosure, whose ‘spots’ are made out of letters. Rojankovsky would again use this technique in his final ABC book in 1970.[7] Interestingly, this technique of constructing an animal’s body out of letters in an ABC book was used in a 1917 ABC book by Sybil Rebman, published by the Chicago-based P.F. Volland Company. But rather than just filling the animal’s outline with random letters, in Rebman’s book each animal associated with a letter in the alphabet is constructed out of letters which also spell out the word for that animal. Although neither Rebman nor Volland have the name recognition of Rojankovsky, in 1917 her illustrations and concept are artistically innovative (albeit pedagogically dubious), whereas in 1962 Rojankovksy’s repetitive wild-life images and simplistic format look dated. Nonetheless, Rebman’s use of letters is conventional in that they spell out a word, while Rojankovsky’s use of letters decouples them from any corresponding sound or meaning.

Animals in the Zoo (New York: A.A. Knopf, 1962).

Inside pages of Animals in the Zoo. 

ABC: An Alphabet of Many Things. (New York: Golden Press, 1970). Cotsen 152493

Sybil Rebman’s Animal Alphabet. (Chicago.: P.F. Volland Company, 1917). Cotsen 21871

In addition to the English editions of his books for Domino Press, by 1936 many of Rojankovsky’s books in Les Albums du Père Castor were translated into English and distributed in the U.S., brokered by the director of the Artists and Writers Guild, Georges Duplaix. Thus well in advance of his arrival in 1941, his work was well received. Duplaix would be a critical player in Rojankovsky’s life and work, facilitating safe passage out of southern France ahead of Hitler’s army in exchange for a long-term exclusive contract. When Duplaix launched his new commercial venture The Little Golden Books series in 1942, Rojankovsky joined a stable of other émigré artists, including Gustaf Tenggren and Tibor Gergely, to produce some of the most widely published children’s books of the 20th century. Of the over 35 books Rojanksovsky illustrated for the Golden Books from 1943 to 1970, there are none that fit the classic ABC book format. Instead, Rojankovsky’s work focuses mostly on animal stories. His second Golden Book – Animal Stories, written by Duplaix,[8] links animal stories to the alphabet in what seems to have been Rojankovsky’s main interest in ABCs all along– experiments in typography.  In the heart of the book, in between fanciful tales of animals and a display of Rojankovsky’s illustration styles, there are 6 pages of illustrated block letters, each accompanied by a small verse  (21-26). These were later issued as a set of nesting blocks.[9] Even more interestingly, these letters were used in a series of magazine advertisements for Puss n’ Boots cat food during the 1940s.[10]

Georges Duplaix and Feodor Rojankovsky. Animal Stories. (New York: Simon and Schuster, 1944). Cotsen 7370

Playschool Golden Book Nested Blocks. Golden Press, Inc. Licensed by Playschool, 1958.

Rojankovsky illustrated “Puss n’ Boots” cat food advertisement, Saturday Evening Post, [1949]

Golden Books had not abandoned the ABC-book concept or format, but in the late 1940s and early 1950s, this task had already been taken over by a new group of illustrators – Richard Scarry, Garth Williams, and Cornelius DeWitt. The format also changed: No longer single letters with a single representative figure on a white background, the emphasis had switched to filling the pages with as many representative images as possible. Rojankovsky replicated this in his final book, which fittingly was an ABC book — Alphabet of Many Things (1970), but the end papers again reveal his fascination with typography.

Page spread from ABC: An Alphabet of Many Things. Cotsen 152493

Endpapers from ABC: An Alphabet of Many Things. Cotsen 152493

Rojankovsky’s use of foliage to construct letters of the alphabet appears in many titles to many of his books, which are otherwise unrelated to Alphabet books: Animals on the Farm (Knopf, 1967), and Frog Went A-Courtin (1955), to cite two, and the 1950s-style ABC book format is replicated in the 1960 Animal Dictionary for the Little Golden Books,[11] but, with the exception of Frog Went A-Courtin these are rather joyless, perfunctory endeavors, displaying none of the wit and innovation of Rojankovsky’s other work. Rojankovsky was working under exploitative terms both for Paul Faucher and for Duplaix and the Golden Books. His compensation was low, and the demands of such large-scale publishing operations limited his artistic freedom. Much of his later work seems to be re-assemblages of previous pieces, and one might wonder whether his “recognizable style” is more a function of images he repurposed for his own self-preservation, particularly when his entire oeuvre is examined. A revisitation of his work through the lens of ABC-books reveals its range and variety even when working with the constraints of publishing houses, during political turmoil, and across continents.

Watson, Jane W, and Feodor Rojankovsky. Animal Dictionary. New York: Golden Press, 1960.

[1] Allen, Irving, Polly Allen, and Koly T. Rojankovsky. Feodor Rojankovsky: The Children’s Books and Other Illustration Art, 2014. Kindle edition, location 893.

[2] Averill, Esther Holden, Lila Stanley, and Feodor Rojankovsky. Daniel Boone: Historic Adventures of an American Hunter Among the Indians. Paris: Domino Press, 1931.

[3] Esther Averill Biography from the “Jenny Linsky and the Cat Club” page: http://www.93950.com/cat_club/bio.htm

[4] Faucher, Paul, and Feodor Rojankovsky. ABC / jeux du Père Castor ; dessins de F. Rojan.

[5] The Cotsen Children’s Library at Princeton has a digital archive of historical ABC books with many examples https://etc.princeton.edu/abcbooks/. Two of the most prolific publishers during the turn of the century period were Raphael Tuck (London), and the McLoughlin Brothers (New York).

[6] Rojankovsky, Feodor. Animals in the Zoo. New York: A.A. Knopf, 1962.

[7] Rojankovsky, Feodor. ABC: An Alphabet of Many Things. Golden Press, 1970.

[8] Duplaix, Georges, and Feodor Rojankovsky. Animal Stories. New York: Simon and Schuster, 1944.

[9] Playschool Golden Book Nested Blocks. Golden Press, Inc. Licensed by Playschool, 1958.

[10] Coast Fishing Company, Wilmington, Ca. The one-page ads with a featured animal, accompanied by a letter of the alphabet inscribed by Rojankovsky, ran in the Saturday Evening Post from 1947 to 1948. These were compiled into a promotional book in 1949.

[11] Watson, Jane W, and Feodor Rojankovsky. Animal Dictionary. Racine, Wis: Golden Press, 1960.