For Continuing Debate

				Contact \| Subscribe \| Site Map
			HOME \| CONTENTS \| CONTRIBUTE \| ARCHIVE

Current Editorial

	Reflecting on the Hamas-Israel War: This issue of Phalanx, which has come after a long while because of the COVID Pandemic, is focused largely on AI but the Hamas-Israeli war has pushed AI to the periphery in the international attention. The editorial therefore looks at the war, reflects on what Hamas hoped to gain from it and the future of Palestine and Israel. Read

Review
	Film:
	Portraying Science in Cinema Oppenheimer (Christopher Nolan, 2023) and Dau (Ilya Khrzhanovsky, 2019) Read

Articles list of Issue

Home > Archive > Article: Hans Verghese Mathews

Borges and Machine Translation:
'Blue Tigers'

Translation by Hans Verghese Mathews

This is a translation of a story ‘Blue Tigers’ by Jorge Luis Borges. As hitherto done with previous translations in Phalanx the actual translation is provided through a link to a PDF. But before the actual story is made available this exercise begins with a note from the translator reflecting on the experience of translating Borges online using the different tools available although the actual translation is done in the normal old-fashioned way.

Apology
A story by Borges in an issue given over to AI seems out of place ―― more so that it should be Blue Tigers, Tigres Azules, one of the very last and more strange among the ficciones ―― and that the editors should have indulged themselves so will seem only laziness, moreover, to aficionados; who, even if they think the phrase ‘artificial intelligence’ an oxymoron, could point to more pertinent writing in the oeuvre; to Borges’s divagations on the characteristica universalis projected by Leibniz, for instance, or to his singular Defense of the Kabbalah even.

Some excuse should be offered; or attempted at least; and an opening of sorts ― a chance chink, not a “window of opportunity” ― is provided by the ending of Tigres Azules: the englishing of its last sentence presents a peculiar obstacle, as it happens, to machine-translation out of Spanish into English. Defending such a claim would not be easy ―― to advance a comprehensive argument in any economical way is beyond me ―― and I shall only try to suggest why.

So that there will not be a riot of quote-marks I shall use the courier font from now on, rather than single-quotes, to refer to words or phrases within sentences:
to “mention” rather than “use” them, as philosophers of language used to say. But I shall keep on using double-quotes in the usual ways: either to quote verbatim or, else, to hedge assertion.

No oí los pasos del mendigo ciego ni lo vi perderse en el alba: our ficcion closes just so, upon a sentence set off by itself. Our translator has hazarded what an algorithm is likely to discard ―― I did not hear the steps of the blind beggar, nor see him lose himself in the dawn ―― because perderse is not functioning as a reflexive pronominal verb in the sentence: which a transcribing algorithm would segment and parse first, assigning “parts-of-speech” to words and phrases, before proceeding. I apologise for the terminology of the grammarians; but resorting so is the shortest way to go, and, besides, the Web will tell curious readers enough about pronominal verbs in Spanish, and in French; and in Latin too, should they be curious enough.

The empirical man will ask if one could ever see someone lose himself any- where: but he will have better things to do, doubtless, than read Borges. There are a good many sites offering automatic translation on the Web, and I thought it prudent to test my guess by passing our sentence to their algorithms. I tried seven sites; and (with the names of the sites appended) here are the English sentences that were returned:

A     No oí | los pasos | del mendigo ciego | ni lo vi perderse | en el alba


A1 I did not hear | the footsteps | of the blind beggar | nor did I see him get lost | in the dawn

Google Translate

A2 I did not hear | the footsteps | of the blind beggar | nor did I see him lose himself | in the dawn

DeepL

A3 I did not hear | the footsteps | of the blind beggar | nor did I see him get lost | at dawn

Lingvanex

A4 I did not hear | the steps | of the blind beggar | nor did I see him get lost | at dawn

Systran

A5 I did not hear | the footsteps | of the blind beggar | or see him lose himself | in the dawn
Translate.com

A6 I did not hear | the footsteps | of the blind beggar | nor did I see him lost | at dawn

Reverso

A7 I didn't hear | the blind beggar's footsteps | or see him get lost | at dawn

PROMT

I have sectioned A in the way that all but two of the seven transcribing sentences seem to have segmented the original: A2 and A5 are the evident exceptions.

The algorithms of DeepL and Translate.com were not blind-sided by their parsers, it looks, in the way the others seem to have been; for while get lost may be a competent transcription of perderse by itself, paying no heed to context, and though at dawn for en el alba may be passable elsewhere, the rendering get lost at dawn dissociates alba from perderse distortingly here: in A the losing does not happen at dawn merely.

Translate.com employs “machine translation technology” and “artificial intelligence” the site says ―― distinguishing Iberian from Latin American Spanish and Standard American from English, reassuringly, in the provided options ―― and that or rather than the literal nor was put in for ni suggests that its algorithm is more than ordinarily alert to context and usage (if without conspicuous success just now); and it is tempting to say the same of the DeepL routine.

The output of PROMT was tweaked to sound conspicuously colloquial; and, as one might expect, the interface provides a “contexts” button activating an algorithm that, given a string of words, will retrieve actual samples of putatively cognate usage from its database. For perderse en it supplied the following sentence together with a translation:

B   En este mundo, es fácil perderse en los engaños y las tentaciones.

B1 In this world, it’s easy to lose oneself in all the deceptions and temptations.

The all interpolated in B1 is surprising (one could excise all the there without loss) but, all the same, the transcribing routine seems to have sectioned B in a natural way: linking en to perderse as much as to los engaños y las tentaciones it looks. One wonders why the PROMT algorithm did not do likewise in transcribing A: as the routines of both DeepL and Translate.com will now appear to have.

Noticing that the comma in B did no work, really, I thought to experiment a
little more by sending

C en este mundo es fácil perderse en los engaños y las tentaciones

to DeepL: expecting that its routine would have en qualify perderse once more, as I supposed it to have done in transcribing A with A2. But what it returned was the less than natural.

C1 in this world it is easy to get lost in deception and temptation.

alas: which indicated, to the contrary, that en had been assimilated into the noun-phrase los engaños y las tentaciones without reference to the verb perderse.
The next step was to try Translate.com with C in the hope that it would do better than DeepL: but

C2 in this world it is easy to get lost in deceptions and temptations.

was the disappointing result. A conscientious man would have repeated the exercise with the other sites used to translate A, but I did not have patience enough for that (not just then. But a few days after I did try both DeepL and Translate.com with the final two sentences of Tigres Azules, to see what difference that might make to their transcriptions of the last one: there wasn’t any.)

So, leaving things just so, I shall try now to characterise the “peculiar obstacle” that, as I had asserted, machine-translation from Spanish to English must overcome; and the complication, very summarily, is that Spanish has much richer ways than English to lexically register the modulation of agency in daily doing and common experience.

Here are three suggestive examples of verb and cognate reflexive pronominal:

caer                                    caerse
to fall                                 to drop

negar                                   negarse
to deny                               to refuse

salir                                     salirse
to leave                               to escape

To pursue the matter one would have to consider generally the use of such and like “deponent” verbs in Spanish ― inherited from Latin, and preserved there more than in French maybe ― the complex semantics and pragmatics of which have been studied quite extensively (so a quick look online suggested) with a view to characterising how their morphology indexes agency medial to and varying between active doing and passive enduring. I shall venture, thereupon, that learning the use of deponent verbs requires a body: somatic proprioception would condition success ―― “inner sensing” must weigh syntactic proximity properly ―― and the discriminations of inner bodily affect are precisely what a successful transcribing algorithm would have had to detect, somehow, sans body, in the disjuncta membra of the corpora it was trained upon. Readers acquainted with linguistics would have pronounced opinions on the possibilities here. The partisans of Lakoff, for instance, are likely to agree that mastering deponents requires a body ―― and would deny, maybe, that an algorithm could sufficiently detect how proprioception conditions utterance (in poetry parti- cularly) ―― but disciples of Chomsky might be more circumspect: even though the master has not been very complimentary about “natural language processing” by computer, as far as I know (indifferent to literature though he appears to be), and he has scoffed at the “generative AI” that has birthed the seeming wonder risibly named “ChatGPT”.

To say anything more without probing the processing algorithms themselves would be very foolish however ... and an enthusiast for AI would say that my tinkering has proved nothing at all ...
so I must leave the reader to Borges and his blue tigers now:
venturing this much more only, and improperly maybe, that the man who contrived those estranging fictions was a lyric poet fending, with words, within words, the perditions ― las perdiciónes ― of old age.

Hans Verghese Mathews

Complete Story: Blue Tigers

courtesy: https:Jorge_Luis_Borges_bibliography

Top