Post45 Data Collective
  • Data
  • Tools
  • Style Guide
  • Submissions
  • About
    • About
    • People
    • How We Made This
  • News

Table of Contents

  • Data Table
  • Significance & Context
  • Collection and Creation
  • Dataset Description
  • Ethical Considerations
  • Bibliography
  • Resources used to make the dataset:
  • Licensing

Small Press Distribution Bestseller Lists 2006-2023

fiction
poetry
nonfiction
bestsellers
publishing
dataset
draft
Authors

Sera-Ann Hargrove

Micah Bateman

Published

June 12, 2026

Doi

10.18737/857371

Abstract
This dataset documents the bestselling authors and titles distributed by Small Press Distribution (SPD) between 2006-2023, including American poetry, short and experimental fiction, and hybrid nonfiction and essay.

Data Table

TipCreative Commons License

This work is licensed under CC BY 4.0

Significance & Context

This dataset bears out narratives of the emergence, reach, and sometimes decline of many small presses, authors, and titles that have often become central figures in American poetry, short and experimental fiction, and hybrid nonfiction and essay. Independent publishing houses used Small Press Distribution (SPD) to distribute their publications to booksellers from 1969 until March 2024, when the company abruptly shuttered. More than a logistical middleman, SPD functioned as a central piece of cultural infrastructure for U.S. literary small presses. For many small publishers, SPD meant the difference between visibility and obscurity, providing the primary means of circulation and distribution, bookstore access, and institutional legitimacy. SPD aided presses producing often groundbreaking, prize-winning, and experimental poetry, fiction, and nonfiction. In May of 2006, SPD began posting monthly features highlighting their bestselling titles in fiction, poetry, and nonfiction to their website, with lists from the 2010s and 2020s featured and still accessible on their now inactive Instagram and Facebook accounts and lists from before 2010 available through the Internet Archive’s Wayback Machine. These bestseller lists index a distinctive economy of literary value shaped by nonprofit distribution, independent bookstores, aesthetic formation, curricular adoption, and long-tail backlist circulation. Spanning the rise of social media–mediated literary publicity and the decline of brick-and-mortar distribution infrastructure, the dataset captures a crucial, transitional period in U.S. small-press literary culture. With SPD now defunct and its web presence inactive, the need to preserve information about SPD’s operations is urgent. Collecting and preserving SPD’s bestselling titles can support the searchability of these works, the authors who created them, and the independent publishers who relied on SPD to bring these books to market.

The data in this project tells the story of emergent authors and presses, new prize winners, forgotten independent presses, and frequent bestsellers. One illustrative case is Renee Gladman, who appears as a new author at the start of SPD’s bestseller lists and becomes the most frequently appearing fiction author, with 41 appearances. Gladman’s first book, Juice, appeared in the August 2006 Poetry Bestseller List, with following lists marking nine more books throughout Gladman’s career. The most frequent fiction book to appear in this data is Garielle Lutz’s first book, Stories in the Worst Way, with fifteen appearances. Originally published in 1996, Lutz’s debut appears on the bestseller lists from 2006 to 2011 with new editions from 3rd Bed and Calamari Press. The most frequently mentioned fiction publisher is Dorothy, a publishing project begun in 2009, appearing a total of 121 times with 22 titles.

As for nonfiction, the most frequently mentioned book is Plants and Landscapes for Summer Dry Climates of the San Francisco Bay Region by East Bay M.U.D. Water Conservation Staff. Three authors, Lyn Hejinian, Ron Silliman, and Rae Armantrout, were tied for the most frequent nonfiction author appearances on the list, with a total of 15 entries each (13 of which came from The Grand Piano parts 1-10). The top nonfiction publisher by number of appearances is Essay Press, appearing 40 times. For poetry, the most listed book is Tao Lin’s debut, You Are a Little Bit Happier Than I Am, published by Action Books in 2006 and appearing on the bestseller lists 21 times until 2010. The poetry publishing press appearing the most on the lists is Flood Editions with a total mention count of 164. The most frequent poet in the data is Sherman Alexie with a total of 48 appearances across seven books. (Counts reflect appearances as recorded in SPD’s genre-specific lists, even where individual titles appear across multiple genres.)

As SPD noted in its publicity, many of these titles and authors have won literary awards. Some of these winners have appeared frequently and not so frequently on SPD’s bestseller lists. One prize winner that SPD names on their website is National Book Award in Poetry winner, Punks: New & Selected Poems by John Keene, which appears on the bestseller lists sixteen times. However, Pulitzer Prize in Poetry winner, Olio by Tyehimba Jess, only appears once on the bestseller lists. With this in mind, the frequency of repeated titles in this data does not always correlate to critical acclaim. Repeated appearances often indicate sustained backlist circulation rather than moments of sudden popularity, offering a view of how small-press titles remain economically viable over long periods through course adoption, independent bookstore loyalty, and niche readerships. These lists also offer insight into the lifespans, phases, and vulnerabilities of small presses. The second most frequent poetry publishing press in this dataset, Ahsahta Press, closed down at the end of 2019, while the Song Cave, a press that began appearing on bestseller lists in the early 2010s, dominated the lists by the end of SPD’s run. Taken together, these patterns enable press-level analysis of longevity, aesthetic coherence, and institutional durability within the U.S. small-press ecosystem.

Collection and Creation

To create this dataset, we documented the SPD bestsellers posts from Instagram (June 2021-December 2023) through screen captures, transferred to text, and compiled. From there, the title, author, publisher, ranking, genre, date, and Instagram post link were used as starting points for the dataset. Using the Library of Congress Catalog and WorldCat, the entries were checked for accuracy, and fields were added for earliest publication date and publication location. Lists from May 2006 to May 2021 were accessible through the Internet Archive’s Wayback Machine’s captures of the Small Press Distribution website. Using these lists, a full document was made of the combined total of lists from Instagram and the Wayback Machine, creating a total of 7,820 rows in the final dataset.

The authors then extracted the data from the master list of bestseller lists and put it into Google Sheets. They used REGEXTRACT to separate the title, author(s), and publishers, and put them into their own columns, and CONCATENATE to format the author names as “last name, first name.” Fields were then added for ISBN number, OCLC number, and author VIAF permalink and ID number. The authors used WorldCat to ensure correct titles, authors, publisher, and publication date, as well as to pull the ISBN and OCLC numbers. Critical researcher judgment was used to identify the edition, with consideration of the date of the bestseller list and the specific publisher, with earliest publication date selected in most cases. Using the Virtual International Authority File (VIAF), author names were checked and changed to the preferred name listed for the author according to the Library of Congress/NACO (Name Authority Cooperative Program). Metadata were manually added to the additional fields with additional help from the Post45 Collective’s BookReconciler Service on OpenRefine. Reconciling the title and author fields through the BookReconciler Service, ISBN numbers, OCLC numbers, and VIAF links were extracted from Library of Congress, Google Books, WorldCat, and VIAF. Any missing metadata that could not be reconciled through OpenRefine were finally manually added. Where inconsistencies in naming or classification appeared, metadata fields were standardized while preserving the structure and genre designations of the original lists.

This dataset includes the bestseller lists published on the Small Press Distribution website from the years 2006 to 2021 and the SPD Instagram account from the year 2021 until their close in 2024. Excluded from this dataset are the annual bestseller lists SPD posted in 2006 for poetry and fiction, the 2000-2009 poetry bestsellers list, and the top 100 poetry books of the 2010s list.

In some cases, an ISBN number, OCLC number, and VIAF link were unavailable for listings. This was due to the format and publication of some items; for example, the listing, “Tomas Tranströmer Broadside of His Poem ‘Vermeer,’” does not have an ISBN and OCLC number because it is a broadside. As for missing VIAF links, some authors were not listed when searching.

Dataset Description

The first six fields (ranking, title, author, publisher, date, genre) in the dataset are taken directly from the bestseller lists as posted to SPD’s Instagram and website, with edits made by the researchers to account for accuracy and searchability. SPD released their bestseller lists by genre each month for poetry and every other month for fiction and nonfiction, so there were individually generic lists each for poetry titles, fiction, and nonfiction. An “appearance” is defined as a single instance of a title on a monthly or bimonthly bestseller list. Counts therefore reflect repeated list inclusion rather than distinct works, and multi-author or serial works (such as The Grand Piano) contribute to each listed author’s total. This approach preserves the structure of SPD’s reporting while enabling comparison across titles, authors, and presses.

  • ranking: the original ranking of the title as it was listed on the bestseller list
  • title: the name of the book
  • author: the name of the author(s), editor(s), artist(s), and/or translator(s) of the given text formatted as last name, first name, with specifications for editor, artist, or translator in parentheses
  • publisher: the name of the publishing press who published the book
  • publisher_variants: alternate names the publishing presses use or variant of the name listed on the original bestseller list
  • date: the month and year of the bestseller list the book is listed on
  • genre: which bestseller list the book appears on (fiction, poetry, or nonfiction); also the genre of the book
  • publication_date: earliest found year the book was published as indicated by the Library of Congress Catalog and WorldCat
  • publication_location: geographic location of the publishing press, found through the Library of Congress Catalog, WorldCat, and when needed, publisher websites or CLMP
  • marc_country_code: Two-letter country code and three-letter state code as listed from the MARC Code List for Countries
  • isbn_number: ISBN number of the title taken from WorldCat
  • oclc_number: OCLC number of the title taken from WorldCat
  • viaf_permalink: permalink to the VIAF listing of the author
  • viaf_id: ID number for the author’s VIAF listing
  • source_link: a link to the originating Instagram post or Wayback Machine link to the SPD website
  • unique_id: ID number given to each entry by the researchers following the template “SPD0000”

Ethical Considerations

Because SPD’s distribution network reflected particular aesthetic, institutional, and geographic concentrations within U.S. literary culture, this dataset should not be understood as a comprehensive map of all small-press activity during this period. Rather, it documents the circulation patterns of presses and authors operating within SPD’s specific infrastructural and curatorial ambit. Additionally, there is not much information about SPD’s own methodology for devising bestseller lists, so the rankings contained in this dataset may not reflect actual sales or readership numbers for each given month as listed. This dataset serves as a recording of the lists that were posted by SPD as they were. SPD’s bestseller lists were published as is, with no other information indicating sales figures, copies sold, etc. In SPD’s “Top 100 Poetry Books of the 2010s” list, they indicate that the list is a result of the total SPD sales from January 2010 to August 2013. This may or may not be the same methodology they used for creating their bestseller lists.

There was also not much information given on the exact edition for some of the titles, so the publication date and ISBN numbers reflect what was found in WorldCat with researcher judgment based on the time the bestseller list was published. Multiple editions of the same book are even included in the dataset due to the wide date range of publications. For example, Borderlands/La Frontera: The New Mestiza by Gloria Anzaldúa published by Aunt Lute Books is on the list a total of 21 times spanning from 2007 to 2023 and from both nonfiction and poetry bestseller lists, representing four separate editions accounted for by researcher judgment. In most cases, however, earliest publication date is used.

Bibliography

About the Closure of Small Press Distribution: FAQ. (n.d.). Community of Literary Magazines
and Presses. Retrieved May 15, 2025, from https://www.clmp.org/about-the-closure-of-small-press-distribution-faqs/

Bourne, M. (2024, June 12). The End of Small Press Distribution. Poets & Writers.
https://www.pw.org/content/the_end_of_small_press_distribution

Morgan, A. (2024, April 3). “The Small Press World is About to Fall Apart.” On the Collapse of
Small Press Distribution. Literary Hub. https://lithub.com/the-small-press-world-is-about-to-fall-apart-on-the-collapse-of-small-press-distribution/

Presses Previously Distributed by SPD. (2024, March 29). Community of Literary Magazines
and Presses. https://www.clmp.org/news/presses-previously-distributed-by-spd/

Resources used to make the dataset:

Small Press Distribution (@spdbooks) • Instagram photos and videos. (n.d.). Retrieved May 15,
2025, from https://www.instagram.com/spdbooks/

LC Catalog—Browse. (n.d.). Retrieved May 15, 2025, from
https://catalog.loc.gov/vwebv/searchBrowse

Wayback Machine. (n.d.). Retrieved May 15, 2025, from https://web.archive.org/

WorldCat.org. (n.d.). Retrieved May 15, 2025, from https://search.worldcat.org

Licensing

https://creativecommons.org/licenses/by/4.0/

Authors should be listed with Hargrove, Sera-Ann first and Bateman, Micah second.

Citation

BibTeX citation:
@article{hargrove2026,
  author = {Hargrove, Sera-Ann and Bateman, Micah},
  editor = {Manshel, Alexander and Porter, J.D. and Walsh, Melanie},
  title = {Small {Press} {Distribution} {Bestseller} {Lists} 2006-2023},
  journal = {Post45 Data Collective},
  date = {2026-06-12},
  url = {https://data.post45.org/posts/small-press-distribution-bestsellers},
  doi = {10.18737/857371},
  langid = {en},
  abstract = {This dataset documents the bestselling authors and titles
    distributed by Small Press Distribution (SPD) between 2006-2023,
    including American poetry, short and experimental fiction, and
    hybrid nonfiction and essay.}
}
For attribution, please cite this work as:
Hargrove, Sera-Ann, and Micah Bateman. 2026. “Small Press Distribution Bestseller Lists 2006-2023.” Post45 Data Collective, accepted, June 12. https://doi.org/10.18737/857371.
 

Supported by National Endowment for the Humanities Emory Center for Digital Scholarship Built with Quarto