Tuesday, October 20, 2020

Detail #402: Gender-congruent Nouns under Possession

Gender congruence in adjectives is sort of old-hat. However, let's consider a gender congruence that affects the gender of the main noun instead.¹

Let's assume some nouns, such as "looks", "face", "appearance" - nouns that simply have to do with the visuals of a person - and maybe nouns that have to do with the mental facets of a person - "soul", "mind", "mood", etc - change gender whenever they are possessums.

brother-masc.gen face.masc

sister-fem.gen face.fem 

However, in isolation there is a default gender these nouns take. Maybe they're all neuters or something. 

This far, the idea isn't particularly complex or anything - and this seems to be a realistic quirk whereby some nouns denoting certain aspects of a person agrees in gender with that person - nothing weird at all.

How about plurals? Here, we can imagine several possible situations.

1. If the language conflates genders in the plural

Maybe this forces coordinations in cases where mixed gender is implied:

their face-masc and face-fem

maybe

their face-masc.pl and face-fem.pl

or maybe singulars are used for these nouns - thus making them overspecify gender while underspecifying number.

And maybe this also permits shared things: their appearance.sg, although maybe there is a forced plural congruence so you must say "their appearance.pl" even if the intended meaning is singular.

If one wants to say that someone has multiple faces, maybe using forced discongruence and coordination could be a solution:

he has face-masc.acc and face-fem.acc
he is two-faced

However, this imho sounds like claiming he does drag, and so

he has face-masc.acc and face-masc.acc

seems more like it would imply twofacedness. "He has faces" would sort of be blocked by the congruence rule - although one could imagine that the use of incongruent nouns in this case would convey a rather marked meaning - viz. that of having two (conflicting) faces.

2. If the language distinguishes genders in the plural

Here, we can imagine more combinations: their.fem appearance.sg.fem implies a shared appearance, their.fem appearance.pl.fem implies the appearances of each of them taken as a group. Number discongruence with gender congruence may be used to indicate a variety of things (a person of many faces, or many people of one appearance), and gender discongruence would probably not be used all that much.


Finally, if the nouns normally are neuter, one could imagine that masculine or feminine forms are used as a sort of derivation that implies, say female or male looks, faces, etc in specific without necessarily talking about a specific face.

Finally, a question: can we create a distribution that is "alignment-like" out of this idea?


¹ Yes, I am aware that in some languages, syntactic analysis indicates that the genitive noun is higher up in the syntactic hierarchy, but let's ignore that for now.

Friday, October 9, 2020

Real Language Examples: The Case of the Complement and Logic

Preamble

I recently started a blog in Swedish, where I discuss - from a serious linguistically informed point of view - some of the common prescriptive notions that Swedish prescriptivists like advancing. It often turns out that their reasoning and their underlying claims are flawed. The approach consists of cutting out the illogical fluff that besets the debate, and focus on which actual arguments are valid.

This post is a translation/rework of such a post. It's probably not the best translation ever, and I will probably improve it every now and then. It is not only a translation from one language to another, it's a translation from one purpose to another - the usual reader of this blog probably is rather descriptivist rather than prescriptivist, and now, instead of illustrating that an argument is wrong, I want to illustrate a way of thinking about arguments more generally.

I will focus on a particular argument here and debunk it thoroughly. Many descriptivists would probably just shrug it off with a statement like "well, yeah, but language doesn't have to be logical" or something like that. My aim is to illustrate that the argument we are faced with is itself not logical despite having the trappings of a logical argument, and that it can be debunked using logic. Language is logical in some way, it's just that the rules it lives under are rather more complex than those of Logic 101 homework.

The Topic and the Bad Argument

A question that sometimes causes some disagreement in English is the case of the complement of the copula. In effect, is it "it is me" or "it is I"? To compare with a different language, in most varieties of Swedish, it is "det är jag" - i.e. "it is I". There is no controversy in Swedish whether this is correct in the standard language, and most dialects also follow this pattern. The situation in English is of course less clear-cut, although some prescriptivists might think it ought to be entirely clear-cut.

This of course leads to a fair share of disagreement in the anglosphere - grammar nazis can have a hard time accepting two permissible options, or even wrapping their minds around more complicated situations where multiple factors interact. In this situation, any argument - no matter how bad - becomes ammunition in the conflict.

One argument that I've found an anglophone prescriptivist voice in favour of the use of the nominative is that "is" equates, and therefore the subject and complement must be of the same case. Here, as a quote, is the complete chain of reasoning:

"Is" equates. Equals should be similar. What's on the left side of "is" is the subject. What's on the right side is therefore subjective case, not objective. Is that a simple enough rationale for you? Somebody else can give you historical background on how and when this perfectly logical rule came into being.

Clearly, logic is the first, best stop for an answer regarding questions such as this - and this sure looks like an argument from logic, doesn't it? It uses simple, undeniable premises and reaches a clear conclusion without any frills.

Yet, as it turns out this argument relies on a very flawed application of logic! It lacks several premises, and does not really reach the conclusion it pretends to reach. It is rather similar in some sense to the naturalistic fallacy: "things ought to be (represented linguistically) in a way that reflects their natural characteristics" is a mistake, and how much more so the notion that expressing similarity should have the elements be similar. "She resembles me" has 'me' in the accusative and 'she' in the nominative despite expressing similarity. Besides, no one has ever bridged the is-ought gap.

Clearly, the statement is fairly universal in its nature as well - it relies on such fundamental building blocks that it should be applicable in nearly any language, and on nearly any sentence linking two nouns by "is" or its corresponding words in other languages - "is" should, if it holds, always and only link nouns of the same case. Does it?

The car is John's.
Honey, I'm home.

I concede that the second example is somewhat iffy, but "home" in that use clearly is some sort of unmarked locative. Sure, you might say "home" is an adverb, but clearly that would also violate it - it's a noun acting adverbially linked to a personal pronoun by a verb that equates, or it's a personal pronoun being equated with an adverb. Both fail the premise that is equates.

If we take the rule as it was stated literally, it does also ban "The car is John's" though - the two nouns are not of the same case. Any objection to my objection literally signifies that the rule is incomplete (and any completion will at least in part violate its basic form quite strongly.)

We can further note that prepositions fundamentally are not significantly different from case marking; some linguistic literature doesn't even really distinguish, say "allative" from "on X". Yet,
"I am on it", "he was at Seagrave's", "she's in town", "Eric's on drugs", ...

are not blocked by this. Curious, isn't it. Of course, in languages that deal with locatives and such through case forms, these will also violate the "is-links-same-case" rule a lot of the time. 

Even if we reject the idea that prepositions are cases, we still find reason to reject "is" linking same-with-same:

He is with me right now. (Not "he is with I right now", which would literally fulfill the requirement.)

Now, one could object that I am misinterpreting something - "is [noun]" and "is [prepositional phrase]" are different uses of is! But this by itself only strengthens my argument - it shows that the claim '"Is" equates. Equals should be similar."' fundamentally fails. The objection to my argument basically concedes that "is equates except when it doesn't". The rule is way too simple, and hides many important facts about is and other copulas.

How about
woe is me
mir ist kalt ('me.dat is cold', "I am (feeling) cold")
Min Gud är mig en väldig borg ("my god is me a mighty castle" - "my god is a mighty castle for me")

Further, we can show that is is not a strict equality thus:
((is) ≡ (≡)(the house is red) ≡ (the house ≡ red)
(the house
≡ red) (the apple ≡ red) → (the house ≡ the apple)
:= the house is the apple.
In other words, if it's true that is equates, it follows that if the house is red, and the apple is red, the house is the apple. The point here is basically that you just can't translate 'is' into a simple logical operation. Doing so leads to bizarre conclusions, and therefore is has to be something that is different from ≡, in such a way that is can be used to express the idea that ≡ expresses, without being the same operation. This means any "syntactical" considerations that  ≡ carries may be ignored. Among these are the notion that the left and right side may be marked in different ways to show distinctions. Maybe differences in scope are marked by different cases.

Further, if is equates, the right and left side of is should be equivalent, in which case it should be acceptable to switch their order.
The house is red. / Red is the house.
Huset är rött / Rött är huset.

Both of these do work in Swedish and English, but "red" does not serve the same syntactical role as "the house" does, and thus they cannot be equivalent.
The house is red and stands by a beautiful lake.
?Red is the house and stands by a beautiful lake.

The house is red and the house stands by a beautiful lake.
*Red is the house and red stands by a beautiful lake.

We can further notice the following:

a) The captain is a woman.
b) A woman is the captain.
c) A woman the captain (indeed) is.
d) *The captain a woman (indeed) is.
 
Granted, some of these do sound like cheap lines in an amateur production of a story about 18th century pirates with a female at the helm. However, they are syntactically parsable with meaningful distinctions for most speakers of English: a) is unmarked or can convey some surprise, b) could convey some surprise or at the very least highlights that this is somewhat unusual, and c) affirms in some way (either positive or sarcastic) the fact.

These three acceptable utterances have pragmatic distinctions - the information structure of these three utterances are different. If "is" only equated, this would not make sense. Besides, if "is" only equated, d) should also be acceptable.

Empirical data:
1. Danish
In good danish, "it is me" ('det er meg') is acceptable.

2. Russian and Polish
Since I am describing two languages, this description is a sort of average of the two - not entirely accurate for either, but close enough for government work.

In Russian and Polish, noun complements of the copula often are in the instrumental. Adjectives more often are nominative, but in some contexts, they too seem to favour (or even require) the instrumetal. Both the Russian and Polish copula are cognates to the English be/is forms. (C.f. be ~ być, быть, is ~ jest, есть.)

Further, these languages sometimes have quirky case subjects that are not nominative, for instance the Russian 'to have to', надо. With надо, the 'subject' is in the dative - yet complements of надо быть do not agree with the subject in case. Clearly быть does not "strictly" equate.


3. Finnish
In Finnish, the copula can take either nominative or partitive - the subject usually being nominative.

Sometimes, other cases are uses - e.g. the "into a role" case, the translative:
hän ei ole lääkäriksi
he/she is not "into the role of doctor" > (s)he isn't suited to be a doctor

The subject in this construction could also be in the ablative:

hänestä ei ole lääkäriksi
from him/her is not into role of doctor -> (s)he isn't "doctor material".

Much like Russian, Finnish also has verbs whose subjects are not nominative:
häne-n on pakko olla hullu
his/hers has to be mad
(s)he has to be mad
not
*häne-n on pakko olla hullu-n
his/hers has to be mad's
When stating age in years, the opposite holds: the numeral is in the genitive, but the subject in the nominative: hän on viiden - (s)he is five.

Furthermore, the Finnish copula can take partitive subjects when you make existential statements, and when specifying quantities it is mandatory:
heitä oli viisi
*he olivat viisi
*heitä oli viisiä
*heitä oli viittä
they.part were five.nom
No matter what kind of group is being quantified, the subject is partitive and the numeral is in the singular nominative - Finnish has plural nominatives and singular and plural partitives for every number so that gets a bit confusing.

Further, like many languages, Finnish does not have a separate verb for "have", but forms that meaning by using 'to be' and some kind of locative - in Finnish it's the allative, -llA, "by, at, on". "Tuomolla on kirkasta" - "by Tuomo is clear.part", "Tuomo has vodka" - linking a noun in the allative to a noun in the partitive in this particular instance!

4. Many languages in Africa
In Africa, many languages have copulas whose "complements" syntactically and formally actually are objects. (NB: in many of these languages, nouns do not have case, but ~case is conveyed by a verb marked that agrees in gender with the noun.)

5. Swedish
But wait a moment, I stated previously that Swedish takes the nominative? Well, even Swedish raises some objections to the "is equates" model! I will use English word-for-word translations to make it easier for non-Swedish readers to keep up. I am not making statements about English grammar in this section.

Reflexives - "I am myself" - "jag är mig själv" - is perfectly accepted by the standard grammars. "Jag är jag" is also possible, but has slightly different connotations. All reflexives in Swedish are accusative.

In the third person, though, "He wants to be he" means something entirely different from "he wants to be himself". "He wants to be he" requires the two "he" pronouns to have different referents, and signifies that the first wants to become the second. "He wants to be himself", " does not state that two different referents are one, it states that he wants to be (back to) his (usual) self. Thus, to express the notion that he wants to be his own self, the rules require one to use the accusative reflexive.

I have further tried finding rules for the construction "let me be me" in Swedish. In English, it seems the is equates rule actually "seems" to be applied here (or rather, most people probably go for "be me" anyway, and those who think is equates think that's why it should be me there. The underlying mechanisms are indistinguishable from the regular speaker in this case). I feel like "vara jag" (= be I) is more natural in this context, in which case the is equates principle fails in the somewhat unusual other direction: an accusative notional subject with a nominative complement!

6. English
I am not going to point to the actual fact that the overwhelming majority seem to be entirely okay with "it is me". The examples I want to attract attention to are syntactically a bit more complicated.
Would you want to be me?

I found very few examples (in fact none) of this with the complement in the nominative. I am pretty sure some instances of this must be the result of equates-believers - certainly someone who believes it should be "it is I" must have written a sentence with "would ... want to be [personal pronoun]". I may not have searched diligently enough, but this at least indicates that for most people, the intuitive case to use with the copula is the accusative even if they believe that they use the equating strategy.

One step further, we find

606 000g it is hard to be me
0g it is hard to be I

The numeral followed by a superscript g stands for number of google hits.

We could of course also look at the syntax of equating - it should require "it would be hard for him to be me" to become "it would be hard for him to be for me", since "for me" is the PP that is equated with the complement of the copula.

One nice thing to note about the above example is that either we posit some ethereal subject that has a hard time being me, or we posit that to be in it doesn't equate anything with me. Again, "is equates" fails.

My conclusion from these examples is that the natural case for complements in English indeed is the accusative. Using the nominative is a learned affectation that fails for almost everyone whenever the syntax has the tiniest complication. Luckily, most situations with complications involve a noun in the accusative, and so the "accusative is right" and "equal case is right" become indistinguishable in most of those situations. Luckily, a few constructions do exist that show that it's more likely that the "accusative is right" rule is more fundamental to English grammar.

An improved model
It is in fact easy to posit a model that accounts for the syntactical and semantic behavior of the copula without having to resort to the principle I set out to argue against. In addition, the principle I set out to argue against leads to bizarre conclusions, and does not fit with actual empirical evidence.

One of the roles of the copula is simply to signify that the subject is being described by an element that is syntactically subordinate to the copula. This element's case (and sometimes the case of the subject as well) can be controlled by the copula or by syntactic and semantic factors. Other particles and elements may also inform us of the function of the copula (e.g. "There ..." in English turning it into an existential quantifier). The copula can be used to tell of ownership ("is John's"), membership in a group ("is one of us", "is one of the best cooks", "is a freemason"), age ("on viiden"), quantity ("heitä on viisi"), adjectival properties ("the house is red"), location ("I'm inside", "honey, I'm home"), existence, and further serve as an auxiliary (future tense with imperfective infinitives in Russian, obligation with genitive subjects and participles in Finnish, ....)

The advantages of this model is that it's actually applicable to actual real existing languages. It won't fail as soon as you attempt to apply it to Finnish or Russian or honest-to-God English. It also is applicable even to a language that actually completely maintains case equality, or to a language that always requires the complement to be nominative.

Granted, unlike the "is equates" model, the only prediction we can get out of this model is that 'languages will vary'. But, if we assume an empirical approach, it's better if our model makes general but true predictions than if it fails arguably even for the language it attempts to describe, but certainly for a multitude of other languages as well. "Is equates" was an acceptable hypothesis, but once it's proven wrong, we shouldn't use it anymore except for the conclusions we could reach by investigating it.

Conclusion
We can see that the argument that was presented as a logical cause of a grammatical rule is based in a flawed understanding of both logic and language. This is not unusual.

In Swedish, it is "it is I", but not because it's logical - it is thus because that's how Swedish works. In English, it's unstable. Flawed logical arguments should not be used to tar one of the two alternatives as bad or illogical - doing that would actually be illogical. The linguistic intuitions of the speaker community should be the basis for this. It would be a sad day indeed if crappy logic were given a victory just because it looks scientific. It would basically be conceding the victory to pseudoscience and pseudologic.

Sunday, October 4, 2020

Ŋʒädär: Pondering Participles

These are just some early musings on a part of Ŋʒädär grammar. It is not very structured, but is provided as a sort of glimpse into how it comes along.

I have recently been tinkering with the participle system of Ŋʒädär. To go in a bit of an unusual direction I figured I should use a system of pre- and postposited particles to form them. This would give some nice advantages with regards to coming up with grammar: bracketing strategies, ('dance- and singing', "see -en and -ing"), potentially interesting grammaticalization paths (with the same particles having other uses in other parts of the language), and just a system that generates a lot of text in a grammar.

I want a somewhat "suboptimal system", i.e. one where a subset of possible combinations do not lend themselves to any very smooth construction but require full or near-full forms.

One strategy for reducing the optimality could be having some of the forms that are in complementary distribution have the opposite bracketing type: maybe present participles have postpositions and past participles have prepositions:

en eat -> eaten
eat ing -> eating

Maybe for a few quirky verbs, 'en VERB ing' has a special meaning. 

This far, I think the Ŋʒädär participle system will at its core have two tenses x two voices, but the voices will be integrated with the animacy hierarchy in a way that doesn't make it "passive vs active" but rather something like ...:
intransitives take a low marker with low animacy nouns and a high marker with high animacy nouns.

With transitive verbs
low animacy nouns x low animacy marker: active / reflexive
low animacy nouns x high animacy marker: passive
high animacy nouns x higher animacy marker: passive
high animacy nouns x high animacy marker: reflexive
high animacy nouns x low animacy marker: active

Historically, a preposited particle might be a conjunction and a (dummy?) pronoun or even a noun of some sort. A postposited particle might originate with an auxiliary verb or a postposition. These differences in origin may be reflected in different available markings - TAM, etc. One further type of origin for participle particles could be discourse particles.

Some modal distinctions that are even absent in the finite verb might pop up in the participles, but I think the grammaticalization path for those modalities needs to have a reach that would not have hit the finite verbs.