{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "### The Impact of Scale on Content Analysis of Goodreads Reviews\n", "\n", "- We use content analysis: quantitative method for analysing the content of reviews\n", "- Subsets of reviews with different types of focus and different scales (from 1 to 100 to 10,000 to 1 million reviews)\n" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "/Users/marijnkoolen/Code/Huygens/scale\n" ] } ], "source": [ "# This reload library is just used for developing the REPUBLIC hOCR parser \n", "# and can be removed once this module is stable.\n", "%reload_ext autoreload\n", "%autoreload 2\n", "\n", "# This is needed to add the repo dir to the path so jupyter\n", "# can load the modules in the scripts directory from the notebooks\n", "import os\n", "import sys\n", "repo_dir = os.path.split(os.getcwd())[0]\n", "print(repo_dir)\n", "if repo_dir not in sys.path:\n", " sys.path.append(repo_dir)\n", " \n", "import numpy as np\n", "import pandas as pd\n", "import matplotlib.pyplot as plt\n", "import json\n", "import csv\n", "import os\n", "\n", "data_dir = '../data/GoodReads'\n", "\n", "books_10k_file = os.path.join(data_dir, 'goodreads_reviews-books_above_10k_lang_reviews.csv.gz')\n", "reviewers_5k_file = os.path.join(data_dir, 'goodreads_reviews-reviewers_above_5k_reviews.csv.gz')\n", "random_1M_file = os.path.join(data_dir, 'goodreads_reviews-random_sample_1M.csv.gz')\n", "author_file = os.path.join(data_dir, 'goodreads_book_authors.csv.gz') # author information\n", "book_file = os.path.join(data_dir, 'goodreads_books.csv.gz') # basic book metadata\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Loading and Merging Data\n", "\n", "We start with a subset of reviews for frequently reviewed books. To see how this subset was created, go to the [Filtering Goodreads reviews](./Filtering-Goodreads-Reviews.ipynb) notebook. This subset contains all reviews for books that have at least 10,000 reviews each. \n", "\n", "We first load the reviews into a Pandas dataframe, then add metadata for the reviewed books from some of the datasets with book metadata." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | Unnamed: 0 | \n", "user_id | \n", "book_id | \n", "review_id | \n", "rating | \n", "date_added | \n", "date_updated | \n", "read_at | \n", "started_at | \n", "n_votes | \n", "n_comments | \n", "review_length | \n", "review_text | \n", "author_id | \n", "title | \n", "author_name | \n", "review_lang | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0 | \n", "8842281e1d1347389f2ab93d60773d4d | \n", "2767052 | \n", "248c011811e945eca861b5c31a549291 | \n", "5 | \n", "Wed Jan 13 13:38:25 -0800 2010 | \n", "Wed Mar 22 11:46:36 -0700 2017 | \n", "Sun Mar 25 00:00:00 -0700 2012 | \n", "Fri Mar 23 00:00:00 -0700 2012 | \n", "24 | \n", "25 | \n", "1326 | \n", "I cracked and finally picked this up. Very enj... | \n", "153394 | \n", "The Hunger Games (The Hunger Games, #1) | \n", "Suzanne Collins | \n", "en | \n", "
1 | \n", "1 | \n", "704eb93a316aff687a93d5215882eb21 | \n", "2767052 | \n", "c52e231744768e9d7f939d1cbeb87666 | \n", "5 | \n", "Fri Jul 20 13:59:12 -0700 2012 | \n", "Sun Aug 23 20:49:13 -0700 2015 | \n", "Sat Feb 18 00:00:00 -0800 2012 | \n", "NaN | \n", "0 | \n", "0 | \n", "31 | \n", "Exciting, fun, entertaining! :) | \n", "153394 | \n", "The Hunger Games (The Hunger Games, #1) | \n", "Suzanne Collins | \n", "en | \n", "
2 | \n", "2 | \n", "4b3636a043e5c99fa27ac897ccfa1151 | \n", "2767052 | \n", "89f5c6ed51ba6f70d3955a620f9af830 | \n", "5 | \n", "Thu Jun 09 22:05:49 -0700 2011 | \n", "Fri Sep 13 08:47:42 -0700 2013 | \n", "Tue Jul 05 00:00:00 -0700 2011 | \n", "Mon Jul 04 00:00:00 -0700 2011 | \n", "0 | \n", "0 | \n", "201 | \n", "This was the perfect quick read for a beach va... | \n", "153394 | \n", "The Hunger Games (The Hunger Games, #1) | \n", "Suzanne Collins | \n", "en | \n", "
3 | \n", "3 | \n", "012aa353140af13109d00ca36cdc0637 | \n", "2767052 | \n", "77fa951667b104fd565d5bd6c760437b | \n", "5 | \n", "Sun Nov 04 18:57:00 -0800 2012 | \n", "Mon Apr 15 12:57:23 -0700 2013 | \n", "Sun Apr 14 00:00:00 -0700 2013 | \n", "NaN | \n", "0 | \n", "0 | \n", "1523 | \n", "The United States (and I assume most other soc... | \n", "153394 | \n", "The Hunger Games (The Hunger Games, #1) | \n", "Suzanne Collins | \n", "en | \n", "
4 | \n", "4 | \n", "2f6af21d14c83a5df6cdcef5e6af0b3e | \n", "2767052 | \n", "46f876086c1e378859f889e87d1e6e5c | \n", "4 | \n", "Thu Jun 07 10:31:00 -0700 2012 | \n", "Thu Jun 07 10:33:17 -0700 2012 | \n", "Mon Apr 16 00:00:00 -0700 2012 | \n", "NaN | \n", "0 | \n", "0 | \n", "98 | \n", "A page turner. Since I hate reality TV I value... | \n", "153394 | \n", "The Hunger Games (The Hunger Games, #1) | \n", "Suzanne Collins | \n", "en | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
121925 | \n", "121972 | \n", "d168e4a91a8cb0795d72d0adbe9a5897 | \n", "10818853 | \n", "a72358e15220c703fbcd1a61ceb60ea6 | \n", "3 | \n", "Tue Aug 06 16:05:58 -0700 2013 | \n", "Tue Aug 06 16:06:37 -0700 2013 | \n", "NaN | \n", "NaN | \n", "0 | \n", "0 | \n", "107 | \n", "Very shocking content. Not well written. Makes... | \n", "4725841 | \n", "Fifty Shades of Grey (Fifty Shades, #1) | \n", "E.L. James | \n", "en | \n", "
121926 | \n", "121973 | \n", "d43b94b7a0a02e0bbaa6b93b884a0c9d | \n", "10818853 | \n", "f35af15602f353e3c4b8b357ca2cfd01 | \n", "4 | \n", "Sat Jun 16 04:02:33 -0700 2012 | \n", "Sat Jun 16 04:03:52 -0700 2012 | \n", "Fri Jun 08 00:00:00 -0700 2012 | \n", "NaN | \n", "1 | \n", "0 | \n", "45 | \n", "A wonderful, if slightly twisted, love story. | \n", "4725841 | \n", "Fifty Shades of Grey (Fifty Shades, #1) | \n", "E.L. James | \n", "en | \n", "
121927 | \n", "121974 | \n", "43202656e9c338bb711afbc7136ab344 | \n", "10818853 | \n", "0931f46ea40d06bb201410a1c465b2ff | \n", "2 | \n", "Sun Nov 11 01:28:33 -0800 2012 | \n", "Sun Nov 11 01:29:46 -0800 2012 | \n", "NaN | \n", "NaN | \n", "0 | \n", "0 | \n", "118 | \n", "Read to see what all the hype was about. Mills... | \n", "4725841 | \n", "Fifty Shades of Grey (Fifty Shades, #1) | \n", "E.L. James | \n", "en | \n", "
121928 | \n", "121975 | \n", "d94c83867337514c94738b57a1d19677 | \n", "10818853 | \n", "bf6e6e995804cd92d2e0f66a0fe4c5d8 | \n", "5 | \n", "Sat Sep 08 09:20:43 -0700 2012 | \n", "Wed Dec 26 03:13:01 -0800 2012 | \n", "NaN | \n", "NaN | \n", "0 | \n", "0 | \n", "296 | \n", "This book killed the little innocence in me. I... | \n", "4725841 | \n", "Fifty Shades of Grey (Fifty Shades, #1) | \n", "E.L. James | \n", "en | \n", "
121929 | \n", "121976 | \n", "e60fcbb1c70ed4f383145efcae21c7ac | \n", "10818853 | \n", "6b298c960776d63607d06023ad38b567 | \n", "4 | \n", "Tue Jul 21 03:53:31 -0700 2015 | \n", "Sun Jul 26 09:25:26 -0700 2015 | \n", "Fri Jul 24 00:00:00 -0700 2015 | \n", "Tue Jul 21 00:00:00 -0700 2015 | \n", "0 | \n", "0 | \n", "274 | \n", "I actually to my own surprise, enjoyed this bo... | \n", "4725841 | \n", "Fifty Shades of Grey (Fifty Shades, #1) | \n", "E.L. James | \n", "en | \n", "
121930 rows × 17 columns
\n", "\n", " | dependency_type | \n", "dependency_word | \n", "dependency_pos | \n", "dependency_freq | \n", "tail_word | \n", "tail_pos | \n", "tail_freq | \n", "dep_tail_freq | \n", "liwc_category | \n", "
---|---|---|---|---|---|---|---|---|---|
19237 | \n", "head | \n", "bad | \n", "ADJ | \n", "324 | \n", "year | \n", "NOUN | \n", "102 | \n", "1 | \n", "relativ|time | \n", "
19239 | \n", "head | \n", "bad | \n", "ADJ | \n", "324 | \n", "character | \n", "NOUN | \n", "1071 | \n", "1 | \n", "None | \n", "
43829 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "book | \n", "NOUN | \n", "3681 | \n", "20 | \n", "None | \n", "
43830 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "term | \n", "NOUN | \n", "24 | \n", "1 | \n", "quant|relativ|time | \n", "
43832 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "thing | \n", "NOUN | \n", "519 | \n", "13 | \n", "None | \n", "
43833 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "part | \n", "NOUN | \n", "71 | \n", "8 | \n", "funct|quant | \n", "
43834 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "mood | \n", "NOUN | \n", "10 | \n", "2 | \n", "affect | \n", "
43835 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "outcome | \n", "NOUN | \n", "15 | \n", "1 | \n", "None | \n", "
43836 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "ending | \n", "NOUN | \n", "565 | \n", "10 | \n", "relativ|time | \n", "
43838 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "writing | \n", "NOUN | \n", "133 | \n", "4 | \n", "social|cogmech | \n", "
43843 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "guy | \n", "NOUN | \n", "75 | \n", "10 | \n", "None | \n", "
43845 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "choice | \n", "NOUN | \n", "82 | \n", "2 | \n", "None | \n", "
43846 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "memory | \n", "NOUN | \n", "29 | \n", "2 | \n", "None | \n", "
43848 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "person | \n", "NOUN | \n", "134 | \n", "3 | \n", "social|humans | \n", "
43850 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "read | \n", "NOUN | \n", "65 | \n", "1 | \n", "work|leisure | \n", "
43851 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "case | \n", "NOUN | \n", "35 | \n", "1 | \n", "None | \n", "
43852 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "triangle | \n", "NOUN | \n", "142 | \n", "1 | \n", "None | \n", "
43853 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "interest | \n", "NOUN | \n", "47 | \n", "1 | \n", "None | \n", "
43855 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "situation | \n", "NOUN | \n", "43 | \n", "1 | \n", "None | \n", "
43856 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "people | \n", "NOUN | \n", "385 | \n", "1 | \n", "None | \n", "
43860 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "sequel | \n", "NOUN | \n", "18 | \n", "1 | \n", "None | \n", "
43862 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "conclusion | \n", "NOUN | \n", "95 | \n", "3 | \n", "None | \n", "
43864 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "reason | \n", "NOUN | \n", "161 | \n", "1 | \n", "None | \n", "
43865 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "taste | \n", "NOUN | \n", "19 | \n", "4 | \n", "None | \n", "
43866 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "aspect | \n", "NOUN | \n", "27 | \n", "1 | \n", "None | \n", "
43867 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "loss | \n", "NOUN | \n", "24 | \n", "1 | \n", "None | \n", "
43869 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "ass | \n", "NOUN | \n", "22 | \n", "4 | \n", "swear|bio|body|sexual | \n", "
43870 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "stuff | \n", "NOUN | \n", "32 | \n", "1 | \n", "funct|pronoun|ipron | \n", "
43873 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "model | \n", "NOUN | \n", "12 | \n", "1 | \n", "None | \n", "
43874 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "good | \n", "NOUN | \n", "13 | \n", "1 | \n", "affect|posemo | \n", "
43875 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "idea | \n", "NOUN | \n", "119 | \n", "1 | \n", "cogmech|insight | \n", "
43876 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "decision | \n", "NOUN | \n", "95 | \n", "2 | \n", "None | \n", "
43877 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "death | \n", "NOUN | \n", "371 | \n", "1 | \n", "None | \n", "
43881 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "dialogue | \n", "NOUN | \n", "14 | \n", "1 | \n", "None | \n", "
43882 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "game | \n", "NOUN | \n", "187 | \n", "1 | \n", "None | \n", "
43883 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "series | \n", "NOUN | \n", "925 | \n", "1 | \n", "quant | \n", "
43884 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "way | \n", "NOUN | \n", "516 | \n", "1 | \n", "relativ | \n", "
43885 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "movie | \n", "NOUN | \n", "149 | \n", "1 | \n", "None | \n", "
43887 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "sentence | \n", "NOUN | \n", "44 | \n", "1 | \n", "None | \n", "
43888 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "boy | \n", "NOUN | \n", "54 | \n", "1 | \n", "social|humans | \n", "
43889 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "one | \n", "NOUN | \n", "45 | \n", "1 | \n", "funct|number | \n", "
43892 | \n", "child | \n", "bad | \n", "ADJ | \n", "324 | \n", "trilogy | \n", "NOUN | \n", "304 | \n", "1 | \n", "None | \n", "
\n", " | dependency_type | \n", "dependency_word | \n", "dependency_pos | \n", "dependency_freq | \n", "tail_word | \n", "tail_pos | \n", "tail_freq | \n", "dep_tail_freq | \n", "liwc_category | \n", "
---|---|---|---|---|---|---|---|---|---|
218 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "eighth | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
220 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "sappy | \n", "ADJ | \n", "2 | \n", "1 | \n", "None | \n", "
221 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "disjointed | \n", "ADJ | \n", "5 | \n", "1 | \n", "None | \n", "
223 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "government | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
224 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "exceptional | \n", "ADJ | \n", "5 | \n", "1 | \n", "None | \n", "
226 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "middle | \n", "ADJ | \n", "5 | \n", "1 | \n", "None | \n", "
227 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "lengthy | \n", "ADJ | \n", "2 | \n", "1 | \n", "None | \n", "
229 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "later | \n", "ADJ | \n", "2 | \n", "1 | \n", "relativ|time | \n", "
232 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "remarkable | \n", "ADJ | \n", "3 | \n", "1 | \n", "None | \n", "
233 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "separate | \n", "ADJ | \n", "2 | \n", "1 | \n", "None | \n", "
234 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "war | \n", "ADJ | \n", "4 | \n", "1 | \n", "affect|negemo|anger|death | \n", "
235 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "adequate | \n", "ADJ | \n", "2 | \n", "1 | \n", "None | \n", "
236 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "dull | \n", "ADJ | \n", "4 | \n", "1 | \n", "None | \n", "
237 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "special | \n", "ADJ | \n", "2 | \n", "1 | \n", "affect|posemo | \n", "
240 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "triumphant | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
242 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "audio | \n", "ADJ | \n", "2 | \n", "2 | \n", "None | \n", "
245 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "best | \n", "ADJ | \n", "2 | \n", "1 | \n", "funct|quant|affect|posemo|achieve | \n", "
249 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "7th | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
250 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "light | \n", "ADJ | \n", "3 | \n", "1 | \n", "percept | \n", "
251 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "fourth | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
255 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "dystopic | \n", "ADJ | \n", "2 | \n", "1 | \n", "None | \n", "
256 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "expletive | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
258 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "pretty | \n", "ADJ | \n", "5 | \n", "1 | \n", "affect|posemo|cogmech|tentat | \n", "
259 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "paced | \n", "ADJ | \n", "5 | \n", "1 | \n", "None | \n", "
260 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "thrilling | \n", "ADJ | \n", "5 | \n", "1 | \n", "None | \n", "
263 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "anticipated | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
269 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "darned | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
270 | \n", "head | \n", "book | \n", "NOUN | \n", "1630 | \n", "previious | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
5111 | \n", "child | \n", "book | \n", "NOUN | \n", "1630 | \n", "intriguing | \n", "ADJ | \n", "1 | \n", "1 | \n", "None | \n", "
\n", " | dependency_type | \n", "dependency_word | \n", "dependency_pos | \n", "dependency_freq | \n", "tail_word | \n", "tail_pos | \n", "tail_freq | \n", "dep_tail_freq | \n", "liwc_category | \n", "
---|---|---|---|---|---|---|---|---|---|
2501 | \n", "head | \n", "describe | \n", "VERB | \n", "20 | \n", "describe | \n", "VERB | \n", "20 | \n", "3 | \n", "verb|present|social | \n", "
2502 | \n", "head | \n", "describe | \n", "VERB | \n", "20 | \n", "event | \n", "NOUN | \n", "40 | \n", "1 | \n", "relativ|time | \n", "
2503 | \n", "head | \n", "describe | \n", "VERB | \n", "20 | \n", "feeling | \n", "NOUN | \n", "50 | \n", "1 | \n", "None | \n", "
2504 | \n", "head | \n", "describe | \n", "VERB | \n", "20 | \n", "theme | \n", "NOUN | \n", "19 | \n", "1 | \n", "None | \n", "
2505 | \n", "head | \n", "describe | \n", "VERB | \n", "20 | \n", "scene | \n", "NOUN | \n", "30 | \n", "1 | \n", "None | \n", "
2506 | \n", "head | \n", "describe | \n", "VERB | \n", "20 | \n", "explain | \n", "VERB | \n", "29 | \n", "1 | \n", "verb|present|social|cogmech|insight | \n", "
9015 | \n", "child | \n", "describe | \n", "VERB | \n", "20 | \n", "word | \n", "NOUN | \n", "42 | \n", "3 | \n", "None | \n", "
9016 | \n", "child | \n", "describe | \n", "VERB | \n", "20 | \n", "begin | \n", "VERB | \n", "35 | \n", "1 | \n", "verb|present|relativ|time | \n", "
9017 | \n", "child | \n", "describe | \n", "VERB | \n", "20 | \n", "reality | \n", "NOUN | \n", "19 | \n", "1 | \n", "cogmech|certain | \n", "
9018 | \n", "child | \n", "describe | \n", "VERB | \n", "20 | \n", "pull | \n", "VERB | \n", "11 | \n", "1 | \n", "None | \n", "