EDIT: Hey folks! Thanks for the feedback on our previous post. Our vague wording may have misled people as to our intentions, for which we apologize. We are currently taking a step back to rework how best to communicate our intentions in a way that is the most sensitive to the needs of the community. Thank you for your engagement!
We have closed the survey link for now, but here's the survey description for future reference:
What is it about our human-human interactions that makes creatively writing together so compelling and unique? What about that is completely lost in human-AI writing interactions? Do you think ChatGPT is bad at creative writing? Do you have big feelings about any of the above topics? Then keep reading!
We are conducting this study because we are interested in analyzing collaborative creativity methods among fanfiction writers. The goal is to criticize the current state of AI-assisted creative writing and offer suggestions from seasoned creative writers on how it could be improved and designed to actually help the people it affects.
First off: what does human-centered mean? The goal of human-centered research is to design technologies based off of HUMAN interactions, and these technologies should be for HELPING humans without replacing, displacing, or marginalizing them.Â
If you've tried interacting with AI tools like ChatGPT...you might notice they're unhelpful, and even outright bad, when it comes to writing creatively. The goal of this research is to find out: Do people even want them to be helpful? CAN they be helpful in any way? Is it impossible for AI to produce creative writing that can hold a candle to anything a human could write? Why might it be impossible?
Thereâs a lot of research being done in this area that is not very human-centered - it involves making AI tools for creative writing and then asking people how they feel about them, instead of the reverse. We believe that a better approach would be to ask people how they feel about AI tools and whether or not they can be helpful, and propose design guidelines based on that.
We believe that this is particularly relevant to fanfiction authors: due to how AI tools are trained, a large proportion of the dataset for AI-based creative writing is likely comprised of fanfiction, due to how much of it there is on the internet.Â
Weâre looking for fanfiction authors aged 18 and above who co-write fanfiction with one or more collaborative partner(s). This can be short-form (co-writing one-offs, single chapters) or long form (co-writing entire fics, long-term collaborations) - weâre essentially interested in the methods that you and your collaborators use together to produce works of creative fiction.
The provided survey will take approximately 15-20 minutes to complete. If youâre interested in telling us more, you can sign up for a 30-45 minute interview at the end of the survey. Ideally, you and your writing collaborator(s) would be able to attend this interview together. Every interview participant will be compensated with a $10 gift card.
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. Your responses will be kept confidential, although we may publish aggregated results. You may exit the survey at any time.
If you have questions, comments, or concerns, reach out to [email protected].Â
Hello! We are researchers at the University of Washington Human-Centered Data Science Lab, and we are studying modes and methods of collaborative creating writing. Weâd love to have you participate in our study!
Survey link: [now closed]
Weâre looking for fanfiction authors aged 18 and above who co-write fanfiction with one or more collaborative partner(s). This can be short-form (co-writing one-offs, single chapters) or long form (co-writing entire fics, long-term collaborations) - weâre essentially interested in the methods that you and your collaborators use together to produce works of creative fiction. The eventual goal of this work is to suggest more human-centered guidelines for AI-based creative writing tools.
The provided survey will take approximately 15-20 minutes to complete. If youâre interested in telling us more, you can sign up for a 30-45 minute interview at the end of the survey. Ideally, you and your writing collaborator(s) would be able to attend this interview together. Every interview participant will be compensated with a $10 gift card.
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. Your responses will be kept confidential, although we may publish aggregated results. You may exit the survey at any time.
For questions about our research, contact Nisha Devasia at [email protected]
Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
â Live Streamingâ Interactive Chatâ Private Showsâ HD Quality
Anya is LIVE right now
FREE
Free to watch ⢠No registration required ⢠HD streaming
For those of you who donât know, Iâm a doctoral student doing research on fandom.
Iâm currently running a demographics survey about fanartists: who we are, the kinds of art we make, and what kind of backgrounds we come from. Â Iâm hoping to hear from anybody who considers themselves a fanartist and has created fanart in the past 5-10 years.
The survey should take less than ten minutes to do, and your information will be anonymized and kept confidential.
The survey is here: https://forms.gle/3TytDHrjpnLMUMKi8
Please reblog and share it with other fanartist friends!
Thank you so much!
Update #1:
Thank you all so much for the responses and for the time and care that you show to this community! Iâve added links to the Instagram and Twitter versions of this post, if youâre interested in sharing on other platforms.
Update #2:
Somebody reminded me not to put the Image ID underneath the cut!
[Image ID: A monochromatic square, purple image with a patterned border. The title text centered at the top of the image reads âFanartist Demographic Surveyâ in white letters with dark purple outlines. Â The middle left features a cartoony bug-like creature, drawn in matching monochromatic purples, looking down at a clipboard in their hands. The middle right side of the image displays a text bubble coming from the bug-like character stating: âAre you a fanartist? Want to talk about it for science? Take the survey!â Below the text bubble in the lower right corner is a black-and-white QR code that links to the aforementioned survey.]
Hello! We are researchers at the University of Washington Human-Centered Data Science Lab, and we are studying modes of and preferences towards content recommendation. Â Weâd love to have you participate in our study! Â
Weâre looking for fanfiction readers and authors aged 18 and above. Weâre interested in hearing from anyone who actively engages in AO3's folksonomic recommendation system (using things like Bookmarks or recommending content to other friends etc.) and also uses platforms like TikTok, Instagram, Twitter etc. This screening survey will take approximately 5 minutes to fill, and if youâre eligible, we will contact you for a 1-hour interview.
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. Your responses will be kept confidential, although we may publish aggregated results. You may exit the survey at any time.
For questions about our research, contact Sourojit Ghosh at [email protected]
hi! what would you say about the stigma surrounding fanfiction in college settings? iâm applying to undergrad colleges for a stem subject (uw is one of them!) and i want to mention fanfiction in some of my supplements since itâs a huge part of my life at this point, but iâm really on the fence about it, considering the ideas people have about it, and how admissions officers could take it, so if you have any advice there? (your research is really cool!)
Hello!! My experience in academic settings is that while there is less stigma surrounding fanfiction today than in the past, mentioning it's a huge part of my research area can be polarizing. Meaning, some people will almost immediately be excited and have strong positive reactions. Others will be negative and invalidating. At either extreme, whatever I say about my research doesn't change their opinion much. However, a good amount of people I run into don't know anything about fanfiction at all, are surprised to find out it's this massive community, and are genuinely curious about my research. My advice would be to write about fanfiction in your admissions essay with that audience in mind and try win them over. It's worth being vulnerable to an extent in order to find those people who will be in your corner.
Hello! We are researchers at the University of Washington Human-Centered Data Science Lab, studying questions of monetary compensation for fanfiction authors and readers. Weâd love to have you participate in our survey!
Fanfiction Survey Link
Weâre looking for fanfiction readers and authors aged 18 and above. Weâre interested in hearing from anyone who has either received compensation for content they have authored, or paid fanfiction authors for their work. This screening survey will take approximately 5 minutes to fill, and if youâre eligible, we will contact you for a 1-hour interview. You will be compensated $20 via online gift card.
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. Your responses will be kept confidential, although we may publish aggregated results. You may exit the survey at any time.
For questions about our research, contact Sourojit Ghosh at [email protected]
Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
â Live Streamingâ Interactive Chatâ Private Showsâ HD Quality
Anya is LIVE right now
FREE
Free to watch ⢠No registration required ⢠HD streaming
How I Tracked Down a Peculiar Problem in a Fanfic Dataset Using Visualizations
Every review on fanfiction.net has an associated timestamp telling us exactly when it was posted, or so we thought. When trying to find the hours of peak review activity across different fanfiction.net fandoms, I saw some fandoms with very uneven review count distributions (shown below).Â
What made this even more confusing was that some of the fandoms had much more pronounced spikes at 7 and 8 UTC. I compared the fandoms with large spikes to those without. I noticed that ones with spikes tended to be fandoms with many reviews in the early 2000s. I wanted to look at how the distribution of review time changed over each year. I chose to make a heat map of the average daily distribution by year. I did some data wrangling so that I could put year on the Y axis and hour of review on the X- axis. Below is the result.
First Exploratory Visualization:
The resulting visualization made the situation much clearer. Every year should look like the ones between 2012 and 2017, where reviews are relatively evenly spread across the day with small variations at peak hours. Before 2012 we see very different behavior. Around 60% of reviews have a time stamp of 7 UTC, and the other 40% have a timestamp with 8 UTC. We see absolutely no reviews with timestamps for other hours. 2011 is a unique case where we have almost all reviews posted during the 7-8 UTC with less than 1% posted at other hours. To get a closer look at 2011, I filtered the data to just 2011 and used months instead of years on the Y-axis.
Second Exploratory Visualization:
This graph reveals two important clues as to what is wrong with the dataset. The first is what the split between 7 and 8 values is from. 7 and 8 values each have a specific time of year where they are the only review time, switching in March and November. I realized that something else important happens with dates in March and November, daylight-savings time. I looked up the daylight-saving times for 2011 and it was from March 14th to November 6th we see those dates reflected exactly in the data. We even see that March is evenly split because the 14th is close to the middle of the month. In November the 6th is close to the beginning so we see an uneven split. The other thing that this graph shows us is that at some point in December the dates started to match the expected values. To get the most accurate value of the date that this happened I had to switch to looking at the day instead of month, and found that on December 27th all dates are 0 UTC and then after that they seem to be accurate to the minute.
Now that I had the issue clearly defined, I had to figure out why this was an issue in our data in the first place, and hopefully fix it. Instead of exploring our collected data, I saved a lot of time by going right to the source. I went on fanfiction.net, found some old reviews, inspected the webpage to find the UTC time stamp, and converted the timestamps into datetimes. I found that all the old reviews on the site were either 7 or 8 pm. I wasnât able to find an exact reason that the site is inaccurate, but I believe that when the ff.net backend was built in 2000 they decided to save some hard drive space by only saving dates by the day.Â
Here is 2015, a typical year, showing what the review distribution should look like. We can see people reviewing later in the day during summer and winter break. 8 UTC is midnight Pacific and 3 am Eastern, we see the lowest usage during the three following hours when many of the US reviewers are sleeping.Â
While creating these visualisations I learned:Â
1. Visualise both before and after processing data. Before calculating the month from the date, the visualisations didnât discover the inaccuracies. After splitting date into the year and hour of the day variables, the visualisations showed the problems with the dates.
2. Look deeper if something seems weird. When I first saw the problem by accident I almost dismissed it. Going off on a tangent ended up making a discovery that will be helpful for future research with this dataset.
3. Creating a presentation can help with findings. When creating a presentation to the group I built an interactive version of the graph (linked below). The interactive version showed that the data was missing a lot of reviews from when we were scraping the site in late 2016 to early 2017, another important thing to know when using this dataset. Â
You can see the code to create the visualizations here: https://travisneils.github.io/dates/dates_vis.html
You can find an interactive version here: Â https://travisneils.github.io/dates/date_chart.html
Hello! We are researchers at the University of Washington Human-Centered Data Science Lab investigating peopleâs participation in online fan communities like Fanfiction.net to better understand how people form communities in online environments. Weâd love to have you participate in our new survey.
Fanfiction Survey Link
Weâre looking for Fanfiction.net users aged 13 and above. Weâre interested in hearing from anyone who has used Fanfiction.net to read, review, or post fanfiction stories. You donât need to be a current user of Fanfiction.net - weâre also interested in hearing from people who used the site in the past. The survey contains 14 questions and you are not required to answer every question.Â
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. Your responses will be kept confidential, although we may publish aggregated results. You may exit the survey at any time.Â
For questions about our research, contact Niamh Froelich at [email protected].
Hello! We are researchers at the University of Washington Human-Centered Data Science Lab investigating peopleâs participation in online fan communities like Fanfiction.net to better understand how people form communities in online environments. Weâd love to have you participate in our new survey.
Fanfiction Survey Link
Weâre looking for Fanfiction.net users aged 13 and above. Weâre interested in hearing from anyone who has used Fanfiction.net to read, review, or post fanfiction stories. You donât need to be a current user of Fanfiction.net - weâre also interested in hearing from people who used the site in the past. The survey contains 14 questions and you are not required to answer every question.Â
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. Your responses will be kept confidential, although we may publish aggregated results. You may exit the survey at any time.Â
For questions about our research, contact Niamh Froelich at [email protected].
I started this project about seven months ago and I figured it might be time to reintroduce it to all our followers and hopefully introduce it to a few new people.Â
Letâs start with a bit of history:Â This is the post that started this project. Much to my surprise and amazement, that post was reblogged by @ao3commentofthedayâ a few days after we started accepting submissions, which helped this project immensely.Â
The idea behind Comment Exchange is to help all those fan creations that donât get imediate attention and quickly disappear between the countless other fan works that are out there. Especially new creators, creators who arenât active in the social part of fandom and creators in small fandoms rarely get high numbers of interactions on their works.Â
This tumblr is a place for all these works, you can submit your works here, or browse our different masterlists to find new fan content to consume. Preferably both.Â
We have about 175 fan works in almost 100 fandoms and welcome all forms of fan creations for all fandoms out there, hosted on any platform. You can also submit original works.Â
We run events and highlights form time to time so keep an eye out for those.Â
You can find a list of links to more information on this project under the cut.
A study on fanfiction storiesâ update frequency and number of reviews received
As a grad student, I often find myself debating over finishing tasks all at once or spacing them out over a reasonable time period. In the fanfiction community, we have seen stories where multiple chapters are posted on the same day, while others are updated every few months or even years. As authors, if our goal is to attract readers and reviews, how long should we wait between chapters? Is it better off to satisfy our readers with content all at once, or to keep them hooked by posting a bit at a time?
Our Approach
To answer these questions, we defined the âfrequencyâ of updates in a story as the average number of days between chapters posted, and looked at stories with more than one chapter and at least one review from fanfiction.net during the period of 1997 to 2017. In this particular study, we are considering the first story posted by each author to avoid miscalculating the accumulated review count for their subsequent stories. Note that the original chapter publish date/time was not available in the dataset so researchers estimated it from either the story publish time or time of the first review. As a result, this dataset is representative of stories with more reviews.Â
What We Found
In this graph, each data point is a story mapped to the total number of reviews received (y-axis) and average days between chapters posted (x-axis). The x-axis is then divided into 14 bins, to represent âbucketsâ of stories where chapters were posted every 1, 2, 3⌠14 days on average. While there are quite a few stories with up to hundreds of reviews, the median line plotted for each bin indicates that the data is skewed to the right.Â
My initial guess was that stories with chapters posted 3-4 days between each other might receive the most reviews, as readers are likely to revisit the same story for updates every few days. This graph seems to be consistent with this speculation and shows that the first peak is at five days. This means that half of the stories with chapters published five days between each other are observed to have 9 reviews. Other peaks are observed at ten and thirteen days.
Your Thoughts?
How often do YOU update a story? What factors do you consider when planning to post a new chapter? As a reader, would you prefer coming back every few days to read the new chapter and review, or reading them all at once? We look forward to seeing your comments and learning more on this topic!Â
Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
â Live Streamingâ Interactive Chatâ Private Showsâ HD Quality
Anya is LIVE right now
FREE
Free to watch ⢠No registration required ⢠HD streaming
Building Connections through Shared Emotions on Fanfiction.net
Author: Sourojit Ghosh
As a creative writer myself, Iâve always been anxious about getting reviews on the content I put out there. As Iâm sure others who publish any form of writing can attest to, reviews form an integral part of our development as writers. However, I also find myself paying attention to not just what a review says, but also how it is said. Specifically, the emotions expressed in a review often shape my interpretation of it.
With that in mind, we at the University of Washington Human-Centered Data Science Lab (UW-HDSL) are interested in researching the emotions present in the multitude of reviews by the fanfiction community. By investigating a correlation between the lengths of reviews and the emotions expressed in them, we aim to understand the growth of relationships between members of the community as they share likes and dislikes.
Introduction
Our previous research with the fanfiction community has found widespread encouragement for budding relationships in its distributed-mentoring setting. The members of the community, mostly young adults from all over the world, are incredibly expressive in their words and often eager to support each other in the writing process. Most of the reviews we have seen in the community are rife with emotion, with the words jumping off the page with their expressiveness. This collectively supportive environment not only seeks to bring out the best in each individual but also to form meaningful relationships that extend beyond that of anonymous writers and readers of fanfiction.
Methods and Findings
For this exploration, we examined 1000 reviews of various fanfiction stories published on the site. We decided to classify them as exhibiting one of 11 emotions: Like, Joy/Happiness, Anticipation/Hope, Dislike, Discomfort/Disgust, Anger/Frustration, Sadness, Surprise, Confused,  Unknown, and No Emotion. Figure 1 shows an example of a review coded in this way using TextPrizm, a  web tool developed by members of the UW-HDSL.
Figure 1: An example of a review being coded for emotions
By coding these reviews for emotions, we are trying to gain a better understanding of the trends in emotions expressed by reviewers across the community. By identifying such trends, we hope to learn how relationships are formed between users sharing common interests and having similar reactions to certain content. Â
Figures 2 and 3 display our preliminary results so far. Figure 2 represents the number of reviews being classified as having each emotion, while Figure 3 shows the average lengths of reviews in the dataset expressing each emotion.
Figure 2: A bar graph showing the no. of reviews each emotion was assigned to.
Figure 3: A bar graph showing the average no. of words in a review expressing each emotion.
The high number of reviews expressing Joy / Happiness and Like is an encouraging indication of the fact that most users took adequate time to express their positivity and support towards the writers. Another emerging trend can be seen in the reviews marked as No Emotion. This small number of reviews averaging at about 80 words per review was found to contain thoughtful discussions on global issues like religious tensions and sexual violence. While the previously discussed reviews highlight the positivity inherent in the community, these reviews remind us of the incredible maturity and depth of thought that the members also possess, a fact even more inspiring given that the community is mostly comprised of young adults. Â
Conclusion and Future Work
This initial examination of a small set of reviews offers some insight into the correlations between emotions and review length. An exploration of a larger set of reviews may offer some basis for providing statistically significant findings along the lines of the currently observed trends and can provide further insight into the ways in which reviews are integral in the process of users on relationship building on Fanfiction.net.
We would love to hear from you, members of the fanfiction community, about what you think of our work and how you view the emotions expressed in reviews of your writing. At the same time, we would also be interested in knowing if you express certain emotions in your reviews more extensively than others! If you have any questions or concerns about our data, feel free to respond to this post or send up an ask, and we would be happy to get back to you. And, as always, stay tuned for our future work with your wonderful fanfiction community!
Acknowledgments
We are incredibly grateful to Dr. Cecilia Aragon and undergraduate researcher Niamh Froelich at the UW Human-Centered Data Science Lab for the initial ideas behind the project, their insightful feedback, and constant support throughout the process. We are also grateful for the fantastic Fanfiction.net community, which continues to prosper each day and exist as a positively supportive environment for budding and seasoned writers alike.
A time-shifted serial correlation analysis of reviewing and being reviewed.
Acknowledgements: Investigation by Arthur Liu with thanks to Dr. Cecilia Aragon and Jenna Frens for feedback and editing and also to team lead Niamh Froelich.
Is it true that giving someone a review will make that person more likely to write reviews as well? Conversely, is it true instead that writing more reviews yourself will help you get more reviews from others?
In this post, we explore one avenue of reciprocity by analyzing the time series of reviews given vs. reviews received.Â
Of course, you have to be careful with this technique. The inspiration of the analysis we utilized comes partly from Tyler Vigenâs Spurious Correlations site (http://www.tylervigen.com/spurious-correlations) where he shows interesting correlations between clearly unrelated events. With a humorous perspective, he reminds us that correlation is not evidence of causation (since sociology doctorates and rocket launches are totally coincidental), but the analysis techniques here are an interesting technique to investigate potential relationships between two different time series.
Back to our topic of reciprocity, we wanted to investigate the relationship between reviews given and reviews received. We had two hypotheses that we were interested in testing: first, we were curious if users who received more reviews would be more inclined to give reviews themselves. Second, we were curious if giving reviews would help increase the number of reviews you personally received.
To get into specifics, here is an example plot of a real userâs review activity.
Letâs break it down. This plot follows the activity of a single user over the course of several years. It plots the total amount of reviews that they gave (in red) and also the total number of reviews that they had received on their fan fictions (in blue). What this chart shows us is that this is a user who has had a very consistent amount of activity in terms of giving out reviews. It also captures spikes in the number of reviews received (blue) which may correspond to having released a new chapter.
If there was a strong link between reviews given and reviews received in either direction, we would expect to see that increases in one is followed by increases in the other. Here is an example where we witness such a relationship:
Since it is harder to analyze the change in activity level from these cumulative plots, we then looked at the total number of reviews given each month. Hereâs what that looks like for the same person:
This time, it is more apparent that there is a similar pattern in the activity behavior for the reviews given and reviews received. For this example, that similarity is a similar spiking pattern.
From Vigenâs website, we could naively apply a correlation calculation here, but there is a glaring flaw: one of the time series is clearly ahead of the other. So, what if we just shifted one of the time series so they overlapped and then computed the correlation? This is the basic intuition of serial correlation: we apply a range of possible shifts and then compute the correlation between these shifted graphs. The one with the highest correlation would be the one with the best match.
The results for different shifts:
The best shift of â11 framesâ:
In other words, for this person, giving a lot of reviews correlates well with receiving a lot of reviews roughly 11 months later. Of course, this doesnât prove any sort of causation, but we can speculate that the increased amount of reviews this user gave helped boost the amount of reviews they got later!
From this analysis of an individual person, we were curious how this extended to the larger community to see if these same trends existed! The short answer, âeh, not really,â but it is interesting to see why this cool pattern might not generalize adequately.
1. Not all individuals get reviews and give reviews at the same scale
Some users just like to give reviews and some users just like to write reviews!
For instance, here is someone who gives a lot of reviews and didnât get many themselves.
Here is someone who gave some reviews, but then focused on writing stories and received a lot more reviews instead!
For graphs like these, it is hard to apply the analysis we did earlier because the relationship is likely a lot weaker or there might just not be enough data points to capture it anyway.
We can summarize these examples for the overall population by looking at the ratio between reviews given to reviews received.
For this sample of 10k users, we see that those who primarily receive reviews will have a larger ratio (right), and users who primarily give reviews will have a smaller ratio (left). In more detail, a ratio of 1.0 means that they only received reviews. For example: 10 reviews received / (10 reviews received + 0 reviews given) = 1. For a ratio of 0.0, it means they received no reviews. For each ratio, the graph shows the total count of the 10k users who had that ratio.
To address issue (1), we reduced the scope down to users who had a relatively equal ratio of reviews given vs. reviews received.
Additionally, we pruned for users who had received at least 10 reviews. This way, we would have enough data points to use for our analysis. In fact, this is also why there is a large spike in the 0.5 ratio which consisted of a lot of users who had written one or two reviews and received an equal amount.
With this cleaned up, we also computed the lags on a finer scale--weeks--instead of months since we noticed that months were not granular enough. We computed the most common lags, and here is a plot of the results. This lag is the shift applied to received reviews, and the correlation is how well the two series correlated with each other after the shift. A correlation of 1 means that as one increased, the other increased as well, a correlation of -1 means that as one decreased, the other increased, and smaller values such as 0.8 mean that the correlation was positive, but less strong.
So the result here is both a little messier and structured than we had hoped from our hypothesis, but thatâs part of the research process!
To elaborate, in the X dimension, the lag, there isnât a particular range that was significantly denser than the rest. In fact, if we looked at the histogram, we see something like this:
So we lied a little, it looks like that last lag of +20 weeks looks really popular, but this is actually an artifact caused by the serial correlation process. If you recall this graph:
The red line is the chosen lag at the peak. In this case, the shifting actually peaked, but if we had truncated the graph at 5, it would have simply picked that highest shift.
Not convinced? Hereâs the same analytics, but now we calculated up to a lag of 40.
Looks like the 20 bucket wasnât particularly special after all.
So ignoring this last bucket (and the first bucket for a similar reason), we notice that our histogram matches this noisiness that we observed for the lags.
What does this mean? It suggests that there is no general pattern that can succinctly summarize the larger population, and that we are unable to conclude that there is a common average positive or negative lag relationship between the number of reviews someone has given and the number of reviews that they have received. Some authors sent more reviews after receiving more reviews (positive lags), some authors received more reviews after getting reviews (negative lags), and some authors did not exhibit much of a relationship either way (the first and last buckets which didnât find a reasonable shift). Although these relationships do exist, the timing was not consistent overall so we canât say anything about fanfiction.net authors in general.
So...
2. Looking across users, we do not see consistent behavior in a time-shifted relationship between a personâs received and given review count
Even when we look at the lags with the highest correlation (r > 0.7), we see that this even distribution of lags still holds.
In summary, this isnât the dead end! (With research, it rarely is!) But it helps paint a better picture of the users in the community and why this approach may not be well suited to encapsulate it well. We see that the relationship between reviews received and given doesnât follow a necessarily time-shifted relationship and that in fact, this shift can go either direction. Try taking a look at your own reviewing trends, and see where you would be located within these graphs! Are you someone who has a positive shift or a negative time shift⌠or no strong correlation at all?
In the meanwhile, weâre still exploring some other interesting approaches in reciprocity! Stay tuned :)
One of the questions we occasionally get from authors is: âWhat kinds of submissions get the most reviews?â We think this is a really interesting question and weâve started doing some exploratory analyses related to the quantity of reviews that authors receive based on a variety of factors. One of the factors that we decided to check out was the number of words in a chapter. We were curious: Would shorter chapters get more reviews because they might take less time to read? Or longer chapters because there is more for reviewers to dig into? Or maybe thereâs a sweet spot somewhere in between?
Methods
To look into this we took a random subset of 10,000 authors from FanFiction.net with chapter publications over a 20 year period from 1997 to 2017. We then created a scatterplot with each point being one of these 10,000 authors, the x-axis showing the median number of words across their published chapters, and the y-axis showing the median number of reviews received on those chapters. The points are segmented into six groups based on percentile of the total number of reviews received on all chapters they have ever published. We then put trendlines in for each of these segments, so we can more easily observe if there are any relationships between chapter length and reviews received across each of these groups. We also performed this analysis at the chapter with similar findings. The results are preliminary and warrant further exploration, but weâll share what weâve found so far.Â
Results
It turns out that the small number of most highly reviewed authors in the top 1% saw an increase in reviews received up until chapters of almost 5,000 words in length, at which point their chapters began to receive fewer reviews on average.
For those authors whose works are in the top 25% of reviews received (excluding the top 1%), as chapter length increases, the number of reviews received on those chapters does as well. Interestingly, there does not appear to be the same drop off in reviews received for longer stories for these authors as there was for the authors in the top 1% of reviews received.
On the other hand, the remaining authors whose chapters are less highly reviewed saw little change in the length of chapter published with the number of reviews received.
Conclusion
These preliminary results point to some interesting potential implications on how an author might be able to get the most reviews. For the most highly reviewed authors, shooting for a chapter of around 5,000 words in length is most likely to result in the highest levels of engagement. However, for the vast majority of authors, writing longer chapters is not likely to have a negative impact on engagement from reviewers, and may even result in more reviews.Â
How about you?
What are your experiences with receiving or providing reviews based on chapter length? Weâd love to hear whether this is a factor that motivates you or something that you consider when writing or reviewing!
On November 1st, me, @olderthannetfic and some of the great folks from @fanfictiondatascience are going to be doing a virtual @geekgirlcon panel on Research & Data Science in Fandom.
The panel will be on the conâs Twitch channel https://www.twitch.tv/geekgirlcon on Nov 1 at 1pm Pacific (4pm Eastern, 9pm UK).
My presentation includes some brand-new data analysis which will be posted here & on my AO3 in the next couple of weeks!
Hello! We are here again, the researchers at the University of Washington Human-Centered Data Science Lab, where we study fandom and fanfiction communities in order to better understand how people interact and engage with others in online environments. Weâve built a survey, and weâd love you all to participate!
Fanfiction Survey Link!
Weâre looking for fanfic authors of age 18 and above! This survey will ask you about your participation in fanfiction communities as an author. It is part of a project where we have created a prototype feedback tool created for the fanfiction community. It consists of visualizations based on reviews received by each of you as an author. The aim of the project is to help fanfiction authors better reflect on their data.
All parts of this survey were approved by the University of Washington Human Subjects Division Institutional Review Board (IRB) to ensure the protection of your rights and welfare as you take this survey. You may stop participating in the survey at any time. Your responses will be kept confidential, although we will publish aggregated results. Also, weâll ask if you are interested in a follow-up interview. The survey contains 6 to 14 questions in total depending on your answers. You are not required to answer every question.
For questions about our research, send us an ask or contact Netra Pathak at [email protected].
Thank you for your participation!!
Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
â Live Streamingâ Interactive Chatâ Private Showsâ HD Quality
Anya is LIVE right now
FREE
Free to watch ⢠No registration required ⢠HD streaming
A Prototype Review Visualization Tool for the Fanfiction Community
Authors:Â Netra Pathak and Kush Tekriwal
Hey there! Weâre back, the researchers studying the fanfiction community at the University of Washingtonâs Human-Centered Data Science Lab. This time around, weâve created a prototype feedback tool that we hope will be helpful to the fanfiction community. The tool will contain dashboards with concise summary reports and trends of an authorâs reviews that may help the author reflect on their writing. We have a personal motivation to enhance the joy of writing, and are interested in hearing what authors think of our prototype tool.
Introducing the Concept
Weâve found the fanfiction community provides just the right kind of encouragement with its self-sustaining, distributed-mentoring setting. (Distributed mentoring differs from standard mentoring because itâs shared in small pieces by a large number of people.) This environment improves the writing of many authors but also boosts self-confidence. Hence, we thought that gathering review summaries and offering a reflection tool of all feedback received might be useful. This might help to further improve writing proficiency.
In this part of our study, the overarching research question we have is: âHow can visualizations help fanfiction authors further enhance their learning from reviews?â
Weâre interested in your feedback on this prototype visualization tool.
Our hypothesis is that providing an author with a holistic overview of all their reviews, customizable to the story or chapter level, may help the author glance over their work and synthesize areas of improvement. We believe learning from the feedback given in a distributed-mentoring community is important, and the technique of visual analytics (interactive visualizations combined with computation and analytical reasoning) can enable authors to recognize their strengths and weaknesses as writers. In addition, these reports may help authors understand why some chapters are received better than others and whether they have any other correlating factors such as time or other factors.
The tool could be extended to the fandom level, so authors could follow other author trends based on common fandoms, etc.
Background Information and Context of Data
We leveraged a dataset collected by the UW Human-Centered Data Science Lab that contains more than 176 million reviews from Fanfiction.net [2]. For our prototype analysis, we only used a subset of the data of authors and their stories and reviews.
For the purpose of analysis, we have machine-classified reviews into a few categories. The review classifications are generated by ALOE (Affect Labeler of Expressions), an open-source tool developed to train and test machine learning classifiers to automatically label chat messages with different emotions or affect categories [3].
In regard to this blog post, a review can fall into one or more of the 5 categories. Table 1 below provides a description for each of the 5 categories [1] and Table 2 provides sample reviews for each of the 5 categories.
Review Trend Dashboards in the Tool
Below are the screenshots of some of the dashboard screens in the feedback tool. Through these dashboards, we hope each author can explore the story of their journey in the fanfiction community. Please be informed that as the data is sensitive, we have anonymized our results.
Differential privacy techniques have been used and the number of reviews in all figures do not represent any individual authorâs actual count. Also, in Fig 4 and subsequent figures, the story ID and/or author ID do not represent the actual ID on fanfiction.net.
The first three screenshots focus on review types and trends of an individual author over time. We thought it would be interesting for authors to see the trends of the types of reviews they have been receiving over the entire year or on a weekly/monthly basis. This can enable them to analyze their peaks and dips, relate them to any external events, etc.
Fig 1: Overall review trend for one particular story of an author based on different review types over a time period of one year. (The trend can also be seen for all stories together, where the number of reviews equals the sum of all reviews of all stories.) Hovering over a data point gives the details in a tooltip.
In Fig 2, Fig 3 and Fig 6, stacked bar charts are used to show a larger category divided into smaller categories and what the relationship of each part has on the total amount. For example, different review categories as part of reviews received over a month (i.e. a larger category). In that case, each bar represents a whole (all reviews received in a month/week), and segments in the bar represent different parts (review categories) of that whole. Hovering over a segment of the bar chart highlights details specific to the segment.
Fig 2: Review type breakdown of all the stories of a particular author over time (weekly). Time can be customized to be at a weekly, monthly or yearly level. Please note, the review categories here are not mutually exclusive which results in an increased number of reviews for a few types.
Fig 3: Review type breakdown of the stories of a particular author over time, with the review categories being mutually exclusive. Time can be customized to be at a weekly, monthly or yearly level.Â
Now, combining the above screens in one dashboard, we can either see the review breakdown and its trend for all stories together or for each story differently. For each story, we can also see the estimated chapter published dates and link them to the review dates. Hence, this way the dashboard is customizable to reflect either all stories or at story/chapter level.
Fig 4: The dashboard contains the review breakdown in multiple categories, as well as the estimated chapter published date for a single story of an author. The above results are for a particular author ID 317330 (all IDs are anonymized) for a single story ID 936798 (blue highlighted) and similarly, we can see for each of the individual story IDs or for all stories together (see Fig 5 below).Â
Fig 5: The dashboard contains the review breakdown in multiple categories, as well as the estimated chapter published date for stories of an author. These stories are ordered by the number of reviews received by that particular story of the author. Here, it may be assumed that the stories that have received the highest number of reviews are the popular stories for the author.
The final dashboard below enables authors to see at a glance the number of reviews of each of their stories, while also being able to juxtapose their stories. Every author will have stories that receive more reviews and ones that receive fewer, and these dashboards may give them the ability to learn which story characteristics may lead to a greater number of reviews.
Fig 6: The dashboard gives informative review details for all the stories of an author. We can see the number of reviews received monthly and the review categories breakdown for each story of an author. This dashboard potentially gives the ability to analyze which stories were a success and received a lot of update encouragement and positive feedback, while on the other hand, which stories received critical acclaim, constructive feedback, etc.
Does Analysis Matter?
There is an obvious question in mind while seeing these visualizations and data trends: How does the analysis help? How is this reflection beneficial? Just like how customer feedback is crucial for future product development and improvement, no matter the size of the organization; similarly it doesnât matter if I am an author starting out, a well-versed author mid-way in my writing experience, or a proficient author. Analysis provides a better view of what needs to be changed or improved, if any, whether you are an individual, or represent a group, business or company. Such information can be used to make informed decisions. For example, in the context of fanfiction, for a starter it may be useful to know what kind of stories are read and reviewed more and why, what kind of plots are acknowledged more, etc. For an author who has written multiple stories, it may be useful to know which stories received maximum appreciation to continue using similar components and keep up his/her fanbase.
However, all said and done, these are just our speculations! We want to know what you think! We want to know from you if such analysis is helpful to the fanfiction authors, or if you would like some changes. We would love to pivot in the direction that is most useful for you.
Thatâs a Wrap
As we deliver this system of dashboards, we hope to create a positive impact by highlighting the trends and summary reports of review types for the stories of an author. For example, new authors in the community may be able to observe trends such as an increasing number of update encouragement reviews and in turn might feel encouraged to write more. :D
The tool and the dashboards are a medium to see feedback from other authors and readers over time.
We will also be encouraged if we get feedback from you. Please share your thoughts and comments so we can learn about your likings as well! To validate our research, we would also love to work with members in the fanfiction community and know whether our solution is effective or not. We would like to extend this work based on the responses we receive.
This is it for now! In the coming months we will develop more dashboards and post them as there are a plethora of questions we can ask this data. Heartfelt thanks for taking a look at our prototype. If you have any questions or want clarification on any of the data, please donât hesitate to reply to this post, reblog with a comment, or send an ask. Weâll be happy to clear up any confusion the best we can!
Acknowledgments
We would like to express our deepest gratitude towards Prof. Cecilia Aragon and Jenna Frens at the Human-Centered Data Science Lab for their useful critiques, ideas, constant guidance and enthusiastic encouragement of this research study. It was an honor to work with them.
Additional Information
Earlier research in our group, published in the paper âMore Than Peer Production: Fanfiction Communities as Sites of Distributed Mentoringâ has outlined 13 categories that were observed in Fanfiction.net reviews [1].Â
1. Evans, S., Davis, K., Evans, A., Campbell, J. A., Randall, D. P., Yin, K., & Aragon, C. (2017, February). More than peer production: fanfiction communities as sites of distributed mentoring. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing(pp. 259-272). ACM.
2. Aragon C. Human-Centered Data Science Lab  Distributed Mentoring in Fanfiction Communities. https://depts.washington.edu/hdsl/research/distributed-mentoring/. Published 2019. Accessed June 5, 2019.
3. Brooks M. etcgroup/aloe. GitHub. https://github.com/etcgroup/aloe.Â
4. University of Washington, Human-Centered Data Science Lab Âť Research ÂťDistributed Mentoring in Fanfiction Communities. https://depts.washington.edu/hdsl/research/distributed-mentoring/
(passing note from non tumblr friend): Hi! I was at the data science in fandom panel yesterday, and really enjoyed it. Jenna and Ruby were awesome, and the numbers were super interesting. I realized later I had two follow up questions:1. Why did you guys choose to scrape ffn instead of another site or multiple sites?2. Are any of the panelists planning to post the PowerPoint anywhere?
Thank you!!!
(1) FFN is an older site, which was helpful because it was technically more simple to scrape and also contains a rich data from decades of posting
(2) Good question! I believe @destinationtoast has a round-up posted from GeekGirlCon. Iâll work on getting our slides up.
-Jenna
FanFiction Data Science @fanfictiondatascience - Tumblr Blog | Tumlook