Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Victor Vvedensky, Kurchatov Institute Direct contact with technical devices using human speech implies ability of the computer to understand in some way the idea of the message. The people feel semantic similarity of words and can find the word with close or just opposite meaning. Can one find mathematical description of this ability? Transistors flop millions times faster then the neurons are spiking, available fast memory is huge in comparison with the working memory of the brain, the software can be sophisticated enough to beat top chess players, and still, computers fail to interpret vocal commands of humans. Special organization of language in the brain is the probable reason of it’s excellent performance. We start our study with the analysis of compatibility of different elements of language. Each verb can be attributed with the set of properties. The adverbs, which can be readily used together with this verb in many different contexts, reflect these properties. One can смело прыгать to jump bravely ловко прыгать to jump deftly возбужденно прыгать to jump excitedly 1 1 1 1 уничтожать 1 1 1 1 1 1 1 1 носить 1 1 1 1 1 1 0,5 1 расходов ать 1 1 1 0,5 1 1 1 1 1 1 1 1 0,5 0,5 0,5 1 приучать требов ать 1 1 1 в рать 1 1 1 0,5 1 1 призыв ать 1 0,5 0,5 1 1 1 1 1 1 успокаив ать 1 0,5 1 1 0,5 1 покрыв ать 1 0,5 0,5 1 0,5 1 нарушать 1 0,5 1 1 0,5 растягив ать 1 1 1 1 0,5 1 пробуждать 1 1 0,5 0,5 подбив ать 1 1 1 иметь 1 1 разбив ать 1 1 честв ов ать 1 0,5 плев ать 1 0,5 1 убыв ать 0,5 0,5 1 1 в ырабатыв ать 0,5 1 0,5 0,5 0,5 процежив ать 0,5 0,5 1 0,5 1 0,5 0,5 1 0,5 1 1 0,5 1 1 0,5 0,5 0,5 1 1 1 0,5 0,5 1 0,5 0,5 0,5 1 1 0,5 0,5 0,5 1 1 1 1 1 0,5 1 0,5 0,5 0,5 0,5 0,5 0,5 0,5 0,5 0,5 1 1 0,5 0,5 1 1 0,5 0,5 0,5 0,5 0,5 1 1 0,5 0,5 0,5 0,5 0,5 0,5 0,5 0,5 1 0,5 0,5 0,5 0,5 0,5 0,5 0,5 1 1 1 1 1 0,5 0,5 0,5 0,5 1 1 0,5 1 0,5 1 0,5 0,5 0,5 1 1 0,5 1 0,5 0,5 0,5 1 0,5 0,5 1 0,5 0,5 0,5 1 0,5 1 1 1 1 1 1 1 1 лишать 1 1 0,5 счастливо 1 1 смертельно 1 1 пристально укладыв ать 1 0,5 оживленно 0,5 1 безвольно 1 1 1 сердечно 1 1 болезненно 1 1 задумчиво 1 бить понятно следить развязно 1 отчётливо 1 скрытно 1 1 ненадежно 0,5 1 1 точно справедливо 1 1 1 верно мужественно 0,5 изменять преданно безнаказанно 1 0,5 тайно 0,5 незаметно 1 усердно 1 уверенно 1 часто медленно Compatibility values for representative sets of 25 verbs and 25 adverbs of Russian language 0,5 1 0,5 1 1 0,5 1 2D projections of 900 verbs in first 9 dimensions of the multidimensional space Each verb is presented as a dot. The numbers indicate the order of dimensions. Each 2dimensional plane shows distributions for the pair of dimensions with numbers indicated for rows and columns. The uppermost square represents projection on the plane in dimensions “one” and “two” or X and Y. The scale is normalized to unity Distributions of verbs and adverbs in first two dimensions of the multidimensional space 900 verbs and 600 adverbs of Russian language in first two dimensions. The distributions nearly mirror each other so that the most compatible verbs and adverbs are close, while incompatible pairs are on the far ends. Sets of verbs compatible with three adverbs are shown as hazel points. Green points indicate fuzzy cases. To cost, to fill over, to throw down, to continue patiently, industriously, motionlessly 1 1 1 1 0,5 1 1 1 1 1 0,5 1 1 0,5 1 1 1 0,5 1 1 1 0,5 1 1 1 1 0,5 1 1 1 1 1 1 0,5 1 желать жертвовать 1 1 0,5 1 дружить 1 блуждать 1 терзать 0,5 1 1 жалеть владеть хитрить губить 1 1 звучать 0,5 0,5 0,5 1 1 0,5 1 1 0,5 1 1 1 0,5 0,5 0,5 1 1 1 1 0,5 1 0,5 1 1 0,5 0,5 1 1 1 1 1 0,5 1 1 1 0,5 0,5 врать любить 1 1 0,5 1 1 1 1 1 тратить 1 1 1 крепить 1 пробовать плакать 1 1 1 1 1 делить по1 1 1 1 1 1 за1 1 1 1 0,5 1 про1 1 1 1 1 1 с-,со1 1 1 1 1 вы1 1 1 1 1 1 пере1 1 1 1 1 1 раз-,рас1 0,5 0,5 1 1 от1 1 1 1 1 на1 1 1 1 0,5 0,5 у0,5 0,5 1 1 1 до1 1 0,5 1 1 при1 1 1 1 1 вз-,воз-,вос- 1 под1 1 1 1 об1 1 1 из-,ис1 1 о1 1 1 1 в0,5 1 1 0,5 0,5 пред0,5 пре1 служить читать смотреть чистить падать жить давать Compatibility of prefixes with word roots can also be represented as a table for Russian verbs Distribution of Russian prefixes on the two-dimensional plane, calculated using compatibility table of word roots and prefixes. Since the number of prefixes is much smaller then the number of roots (~20 and ~4000) the distribution of prefixes is accurate enough. The opposite is not true. One needs another approach to analyze the space of roots. Small cortical patches with a certain function Brain studies, using special functional staining techniques, reveal confined areas performing definite tasks. These portions of the monkey cortex extract visual features with certain orientation. They are about 1 mm x 2 mm large. Rich internal connections can impart this cortical patch with properties reflected as additional dimensions. That is like blowing up the flat soap film making the bulb in the space with multiple dimensions D.H.Hubel, Eye, Brain, and Vision The principal central connections of hearing The principal central connections of speech The corticobulbar tract is a pathway connecting the cerebral cortex to the brainstem The muscles of the face, head and neck are controlled by the corticobulbar system Die Liste der 30.000 häufigsten Wortformen aus den Textkorpora des IDS, absteigend sortiert nach relativer Häufigkeit. Institut für Deutsche Sprache Zipf Law for words Exponential decay for word roots All the roots can be fit into 2D plot, where they densely fill the portion of the surface. Triangles - roots absent in Czech Triangles - Russian roots compatible with «На» prefix Russian roots with initial consonants «К» and «П» Space of qualities built using frequencies of occurrence of Russian adjectives and adverbs хорошо - well хороший - good прямо - directly прямой - stright сильно - strongly сильный - strong медленно - slowly медленный - slow полный - full полно - completely одновременно - simultaneously одновременный - simultaneous грязно - dirty грязный - dirty поспешно - in hurry поспешный - hasty толстый - thick Positions of different qualities, expressed in Russian words in respect to the positions of actions and entities. Speech generator should be in the upper right corner. 2D distribution of actions, built using frequencies of occurrence of continuous and completed forms of Russian verbs. понимать-понять to understand получать-получить to get кричать-крикнуть to cry вызывать-вызвать to call покупать-купить to buy носить-нести to carry выполнять-выполнить to fulfil выяснять-выяснить to clear up избавляться-избавиться to get rid Cortical Representation of Sign Language Cerebral Cortex, Vol. 11, No. 6, 506-512, June 2001 Sketch depicting interactions of cortical patches storing different elements of speech Аналогично можно построить вмещающее пространство для конечных согласных корней немецкого языка. А также многих других языков.