Data from RusAge (https://www.kaggle.com/oldaandozerskaya/fiction-corpus-for-agebased-text-classification)
RusAge: Corpus for Age-Based Text Classification Russian fiction books' previews with age rating labels.
import pandas as pd
df = pd.read_csv('RusAge.csv', sep=';', names=["filename","book_title","author","age_rating","genres"])
df.head()
df.count()
df[0:50]
df['age_rating'].value_counts().plot(kind='bar')
df['author'].value_counts().plot(kind='bar', figsize=(25,8))
df['author'].value_counts()[df['author'].value_counts() > 20].plot(kind='bar', figsize=(20,8))
df['genres'].value_counts().plot(kind='bar', figsize=(15,8))
filt = (df['age_rating']) == 12
genres_count = df.loc[filt, 'genres'].value_counts()
genres_count
Sherratt, Tim. (2019, November 17). GLAM-Workbench/csv-explorer (Version v0.1.0). Zenodo. http://doi.org/10.5281/zenodo.3544712
RusAge (https://www.kaggle.com/oldaandozerskaya/fiction-corpus-for-agebased-text-classification)