Phalanx Spacer Phalanx Logo Phalanx Slogan Phalanx Spacer
Contact | Subscribe | Site Map
  Phalanx Logo Phalanx Logo Phalanx Spacer
HOME | CONTENTS | CONTRIBUTE | ARCHIVE
 
         
Phalanx Spacer
Phalanx SpacerCurrent Editorial

Reflecting on the Hamas-Israel War:

This issue of Phalanx, which has come after a long while because of the COVID Pandemic, is focused largely on AI but the Hamas-Israeli war has pushed AI to the periphery in the international attention. The editorial therefore looks at the war, reflects on what Hamas hoped to gain from it and the future of Palestine and Israel.
Phalanx Spacer
Phalanx Read

Phalanx Spacer
Phalanx SpacerReview
Film:

Portraying Science in Cinema
Oppenheimer (Christopher Nolan, 2023) and Dau (Ilya Khrzhanovsky, 2019)
Phalanx Spacer
Phalanx Read

Phalanx Spacer
Phalanx SpacerArticles list of Issue
 
Home > Archive > Article: Hans Verghese Mathews
Phalanx Spacer
Borges and Machine Translation:
'Blue Tigers'
Phalanx Spacer

Translation by Hans Verghese Mathews

This is a translation of a story ‘Blue Tigers’ by Jorge Luis Borges. As hitherto done with previous translations in Phalanx the actual translation is provided through a link to a PDF. But before the actual story is made available this exercise begins with a note from the translator reflecting on the experience of translating Borges online using the different tools available although the actual translation is done in the normal old-fashioned way.


Apology
A story by Borges in an issue given over to AI seems out of place ―― more so that it should be Blue Tigers, Tigres Azules, one of the very last and more strange among the ficciones  ―― and that the editors should have indulged themselves so will seem only laziness, moreover, to aficionados; who, even if they think the phrase ‘artificial intelligence’ an oxymoron, could point to more pertinent writing in the oeuvre; to Borges’s divagations on the characteristica universalis projected by Leibniz, for instance, or to his singular Defense of the Kabbalah even.

Some excuse should be offered; or attempted at least; and an opening of sorts  ―  a chance chink, not a “window of opportunity” ―  is provided by the ending of Tigres Azules: the englishing of its last sentence presents a peculiar obstacle, as it happens, to machine-translation out of Spanish into English. Defending such a claim would not be easy ―― to advance a comprehensive argument in any economical way is beyond me  ―― and I shall only try to suggest why.

So that there will not be a riot of quote-marks I shall use the courier font from now on, rather than single-quotes, to refer to words or phrases within sentences:
to “mention” rather than “use” them, as philosophers of language used to say. But I shall keep on using double-quotes in the usual ways: either to quote verbatim or, else, to hedge assertion.

No oí los pasos del mendigo ciego ni lo vi perderse en el alba: our ficcion closes just so, upon a sentence set off by itself. Our translator has hazarded what an algorithm is likely to discard ―― I did not hear the steps of the blind beggar, nor see him lose himself in the dawn ―― because  perderse  is not functioning as a reflexive pronominal verb in the sentence: which a transcribing algorithm would segment and parse first, assigning “parts-of-speech” to words and phrases, before proceeding. I apologise for the terminology of the grammarians; but resorting so is the shortest way to go, and, besides, the Web will tell curious readers enough about pronominal verbs in Spanish, and in French; and in Latin too, should they be curious enough.

The empirical man will ask if one could ever see someone lose himself any- where: but he will have better things to do, doubtless, than read Borges. There are a good many sites offering automatic translation on the Web, and I thought it prudent to test my guess by passing our sentence to their algorithms. I tried seven sites; and (with the names of the sites appended) here are the English sentences that were returned:

A     No oí  |  los pasos  |  del mendigo ciego  |  ni lo vi perderse  |  en el alba

                                                                                                                                                            
A1   I did not hear  |  the footsteps  |  of the blind beggar  |  nor did I see him get lost  |  in the dawn

Google Translate
                                      
A2   I did not hear  |  the footsteps  |  of the blind beggar  |  nor did I see him lose himself  |  in the dawn

DeepL
                       
A3   I did not hear  |  the footsteps  |  of the blind beggar  |  nor did I see him get lost  |  at dawn 
                    
Lingvanex
                       
A4   I did not hear  |  the steps  |  of the blind beggar  |  nor did I see him get lost  |  at dawn       
       
Systran

                       
A5   I did not hear  |  the footsteps  |  of the blind beggar  |  or see him lose himself  |  in the dawn  
Translate.com
                       
A6   I did not hear  |  the footsteps  |  of the blind beggar  |  nor did I see him lost  |  at dawn 
       
Reverso
                       
A7   I didn't hear  |  the blind beggar's footsteps  |  or see him get lost  |  at dawn
       
PROMT
                                            
I have sectioned  A  in the way that all but two of the seven transcribing sentences seem to have segmented the original: A2  and  A5  are the evident exceptions.

The algorithms of DeepL and Translate.com were not blind-sided by their parsers, it looks, in the way the others seem to have been; for while get lost may be a competent transcription of  perderse  by itself, paying no heed to context, and though  at dawn  for  en el alba  may be passable elsewhere, the rendering   get lost at dawn  dissociates  alba  from  perderse distortingly here: in A   the losing does not happen at dawn merely.
                       
Translate.com employs “machine translation technology” and “artificial intelligence” the site says ―― distinguishing Iberian from Latin American Spanish and Standard American from English, reassuringly, in the provided options ―― and that  or  rather than the literal  nor  was put in for ni suggests that its algorithm is more than ordinarily alert to context and usage (if without conspicuous success just now); and it is tempting to say the same of the DeepL routine.

The output of PROMT was tweaked to sound conspicuously colloquial; and, as one might expect, the interface provides a “contexts” button activating an algorithm that, given a string of words, will retrieve actual samples of putatively cognate usage from its database. For  perderse en  it supplied the following sentence together with a translation:
                                               
B    En este mundo, es fácil perderse en los engaños y las tentaciones.
                                                       
B1  In this world, it’s easy to lose oneself in all the deceptions and temptations.
                                           
The  all  interpolated in  B1  is surprising (one could excise  all the there without loss) but, all the same, the transcribing routine seems to have sectioned  B  in a natural way: linking  en  to  perderse  as much as to  los engaños y las tentaciones it looks. One wonders why the PROMT algorithm did not do likewise in transcribing A:  as the routines of both DeepL and Translate.com will now appear to have.

Noticing that the comma in B did no work, really, I thought to experiment a
little more by sending
                        
C   en este mundo es fácil perderse en los engaños y las tentaciones
                         
to DeepL: expecting that its routine would have  en  qualify  perderse  once more, as I supposed it to have done in transcribing  A  with  A2. But what it returned was the less than natural.
                        
C1  in this world it is easy to get lost in deception and temptation.
                        
alas: which indicated, to the contrary, that  en  had been assimilated into the noun-phrase  los engaños y las tentaciones  without reference to the verb perderse.
The next step was to try Translate.com with  C  in the hope that it would do better than DeepL: but
                                 
C2  in this world it is easy to get lost in deceptions and temptations.
                                 
was the disappointing result. A conscientious man would have repeated the exercise with the other sites used to translate A, but I did not have patience enough for that (not just then. But a few days after I did try both DeepL and Translate.com with the final two sentences of Tigres Azules, to see what difference that might make to their transcriptions of the last one: there wasn’t any.)
                                                                    
So, leaving things just so, I shall try now to characterise the “peculiar obstacle” that, as I had asserted, machine-translation from Spanish to English must overcome; and the complication, very summarily, is that Spanish has much richer ways than English to lexically register the modulation of agency in daily doing and common experience.

Here are three suggestive examples of verb and cognate reflexive pronominal:
                                                                       
caer                                    caerse
to fall                                 to drop
                         
negar                                   negarse
to deny                               to refuse
                           
salir                                     salirse
to leave                               to escape
 
To pursue the matter one would have to consider generally the use of such and like “deponent” verbs in Spanish ― inherited from Latin, and preserved there more than in French maybe ―  the complex semantics and pragmatics of which have been studied quite extensively (so a quick look online suggested) with a view to characterising how their morphology indexes agency medial to and varying between active doing and passive enduring. I shall venture, thereupon, that learning the use of deponent verbs requires a body: somatic proprioception would condition success ―― “inner sensing” must weigh syntactic proximity properly ―― and the discriminations of inner bodily affect are precisely what a successful transcribing algorithm would have had to detect, somehow, sans body, in the disjuncta membra of the corpora it was trained upon. Readers acquainted with linguistics would have pronounced opinions on the possibilities here. The partisans of Lakoff, for instance, are likely to agree that mastering deponents requires a body ―― and would deny, maybe, that an algorithm could sufficiently detect how proprioception conditions utterance (in poetry parti- cularly) ―― but disciples of Chomsky might be more circumspect: even though the master has not been very complimentary about “natural language processing” by computer,  as far as I know (indifferent to literature though he appears to be), and he has scoffed at the “generative AI” that has birthed the seeming wonder risibly named “ChatGPT”.
           
To say anything more without probing the processing algorithms themselves would be very foolish however ... and an enthusiast for AI would say that my tinkering has proved nothing at all ...  
so I must leave the reader to Borges and his blue tigers now:
venturing this much more only, and improperly maybe, that the man who contrived those estranging fictions was a lyric poet fending, with words, within words, the perditions ― las perdiciónes ― of old age.

Hans Verghese Mathews




Phalanx Spacer

 






Complete Story: Blue Tigers
pdf



courtesy: https:Jorge_Luis_Borges_bibliography

Top
Phalanx Spacer
Phalanx Spacer
Home | Editor's Desk | Open Page | Content | Contribute | Archive | Manifesto | People | Contact | Subscribe | Site Map | Privacy policy | Legal
Phalanx Spacer
© 2016 PHALANX. All rights reserved | it's an El Remo Creation
Phalanx Spacer
Phalanx Spacer