Home | Papers | Blog

Wybo is a sociologist and PhD/DPhil student at the Oxford Internet Institute.

He studies online social behaviour. Especially: How social media affect protest movements such as the Arab Spring and Occupy Wallstreet. More...

Also: MSc in Social Science of the Internet, MA in Digital Humanities (distinction), 3 BAs (firsts/cum laude) in History, Philosophy of Information Science and Information Science

Papers feed

A Measure of Aggregate Syntactic Distance

(There is a later paper on this)

We compare vectors containing counts of trigrams of part-of-speech (POS) tags in order to obtain an aggregate measure of syntax difference. Since lexical syntactic categories reflect more abstract syntax as well, we argue that this procedure reflects more than just the basic syntactic categories.

We tag the material automatically and analyze the frequency vectors for POS trigrams using a permutation test.

A test analysis of a 305,000 word corpus containing the English of Finnish emigrants to Australia is promising in that the procedure proposed works well in distinguishing two different groups (adult vs. child emigrants) and also in highlighting syntactic deviations between the two groups.