Metrika članka

  • citati u SCindeksu: 0
  • citati u CrossRef-u:0
  • citati u Google Scholaru:[=>]
  • posete u poslednjih 30 dana:9
  • preuzimanja u poslednjih 30 dana:8
članak: 1 od 1  
Telfor Journal
2019, vol. 11, br. 2, str. 102-107
jezik rada: engleski
vrsta rada: neklasifikovan
doi:10.5937/telfor1902102P


Using the random components of the jitter of speech pitch period to assess the state of the user of social-cyber-physical system
(naslov ne postoji na srpskom)
aSouthern Federal University, Institute of Computer Technologies and Information Security, Rostov-onDon, Russia
bSt. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia
cSt. Petersburg State University of Aerospace Instrumentation (SUAI), St. Petersburg, Russia

e-adresa: epakulova@sfedu.ru, vatamaniuk.i.v@gmail.com, visharmail@gmail.com, iakovlev.r@mail.ru, nosovm@mail.ru

Sažetak

(ne postoji na srpskom)
Socio-cyber-physical systems are focused on perceptual interaction with users and involve analyzing and translating his or her physiological and psycho-emotional state. An actual scientific problem is to determine the latter basing on the user's speech signal. In particular, it can be solved relying on investigating the jitter of the speech pitch period. In this paper we propose an algorithm that allows one to improve the noise immunity of determining the pitch period of speech signal and a method of jitter determination based on averaging the change of the pitch period relatively to the current value. We also propose an algorithm for separating periodic and random pitch jitter based on using the discrete Fourier transform on the sequence of the pitch periods with the presence of unknown values in the unvoiced speech frames. Simulation shows that the proposed approach of filling the unknown values of pitch period has better results compared to the existing methods based on interpolation of the nearest known values.

Ključne reči

Reference

Alzamendi, G.A., Schlotthauer, G. (2017) Describing Voice Period Variability by means of Time Series Structural Analysis. u: 10th International Workshop: Models and Analysis of Vocal Emissions for Biomedical Applications, Firence, Italy, Proceedings, 11-14
Basov, O.O., Shalaginov, V.A., Ofitserov, A.I., Bogdanov, S.P., Zatsepin, A.V. (2011) Method of dividing jitter of period of fundamental tone of speech signal. Russian Federation Patent No 2419166, May 20
Basov, O., Ronzhin, A., Budkov, V. (2015) Optimization of Pitch Tracking and Quantization. u: International Conference on Speech and Computer, Springer International Publishing, 317-324
Basov, O., Basova, A., Nosov, M. (2014) Human Resources Management in Conditions of Operators' Psychophysiological State Changes. u: International Conference on Speech and Computer, Springer International Publishing, 259-267
Basov, O.O., Nosov, M.V., Shalaginov, V.A. (2014) Pitch-jitter analysis of the speech signal. SPIIRAS Proceedings, 1(32): 27-27
Bellman, R.E., Dreyfus, S.E. (2015) Applied Dynamic Programming. Princeton University Press
Chu, W.C. (2004) Speech coding algorithms: Foundation and evolution of standardized coders. New Jersey: John Wiley & Sons
Dong, L. (2011) Time Series Analysis of Jitter in Sustained Vowels. ICPhS, 603-606
Draving, S.D. (2005) Method and apparatus for decomposing signal jitter using multiple acquisition. U.S. Patent 6 898 535, May 24
Guenther, M.L. (2007) Method for decomposing timing jitter on arbitrary serial data sequences. U.S. Patent 7 254 168, Aug. 7
Huang, X., Acero, A., Hon, H.W. (2001) Spoken language processing: A guide to theory, algorithm and system development. Englewood Cliffs, NJ: Prentice Hall
Jin, Q., Li, C., Chen, S., Wu, H. (2015) Speech emotion recognition with acoustic and lexical features. u: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 4749-4753
Long, H., Guo, Z., Wu, X., Hu, B., Liu, Z., Cai, H. (2017) Detecting depression in speech: Comparison and combination between different speech types. u: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), IEEE, 1052-1058
National Committee for Information Technology Standardization (NCITS) (2003) Fiber Channel Methodologies for Jitter and Signal Quality Specification
Nosov, M.V., Basov, O.O., Shalaginov, V.A. (2014) Study Jitter Characteristics of the Pitch Period of the Speech Signals. SPIIRAS Proceedings, 1(32): 27-44
Schoentgen, J., Aichinger, P. (2019) Analysis and Synthesis of Vocal Flutter and Vocal Jitter. u: Interspeech 2019, ISCA, 2518-2522
Schoentgen, J., de Guchteneere, R. (1997) Predictable and random components of jitter. Speech Communication, 21(4): 255-272
Schoentgen, J., de Guchteneere, R. (1995) Time series analysis of jitter. Journal of Phonetics, 23(1-2): 189-201
Silva, D.G., Oliveira, L.C., Andrea, M. (2009) Jitter Estimation Algorithms for Detection of Pathological Voices. EURASIP Journal on Advances in Signal Processing, 2009(1): 9-9
Tabatabaei, S. (2010) Jitter spectrum analysis using random sampling (rs). U.S. Patent 7 844 022, Nov. 30
Ward, B.A., Tan, K., Guenther, M.L. (2004) Apparatus and method for spectrum analysis-based serial data jitter measurement. U.S. Patent 6 832 172, Dec. 14
Wilstrup, J.B., Petrich, D.M. (2002) Method and apparatus for jitter analysis. U.S. Patent 6 356 850, Mar. 12