For the past six years, I have been working in automatic speech recognition, speech synthesis, and conversion, In Erlangen, the key focus is to apply speech engineering technologies in the field of the biomedical domain.
Current projects Current projects
Recent publications Recent publications
2024
Schieber, H., Demir, K.C., Kleinbeck, C., Yang, S.H., & Roth, D. (2024). Indoor Synthetic Data Generation: A Systematic Review . Computer Vision and Image Understanding , 240 . https://doi.org/10.1016/j.cviu.2023.103907
2023
Demir, K.C., Schieber, H., Weise, T., May, M., Maier, A., & Yang, S.H. (2023). Deep Learning in Surgical Workflow Analysis: A Review of Phase and Step Recognition . IEEE Journal of Biomedical and Health Informatics , 1-14. https://doi.org/10.1109/JBHI.2023.3311628
Oppelt, M.P., Foltyn, A., Deuschel, J., Lang, N.R., Holzer, N., Eskofier, B., & Yang, S.H. (2023). ADABase: A Multimodal Dataset for Cognitive Load Estimation . Sensors , 23 (1). https://doi.org/10.3390/s23010340
Tayebi Arasteh, S., Rios-Urrego, C.D., Nöth, E., Maier, A., Yang, S.H., Rusz, J., & Rafael Orozco-Arroyave, J. (2023). Federated learning for secure development of AI models for Parkinson’s disease detection using speech from different languages . In Proceedings of INTERSPEECH 2023 (pp. 5). Dublin, IE: Dublin, Ireland.
Tayebi Arasteh, S., Weise, T., Schuster, M., Nöth, E., Maier, A., & Yang, S.H. (2023). The effect of speech pathology on automatic speaker verification: a large-scale study . Scientific Reports , 13 , 20476. https://doi.org/10.1038/s41598-023-47711-7
Weise, T., Maier, A., Demir, K.C., Pérez Toro, P.A., Arias Vergara, T., Heismann, B.,... Yang, S.H. (2023). Impact of Including Pathological Speech in Pre-training on Pathology Detection . In Kamil Ekštein, František Pártl, Miloslav Konopík (Eds.), Text, Speech, and Dialogue (pp. 141-153). Pilsen, CZ: Cham: Springer.
Weise, T., Maier, A., Demir, K.C., Pérez Toro, P.A., Arias Vergara, T., Heismann, B.,... Yang, S.H. (2023). Impact of Including Pathological Speech in Pre-training on Pathology Detection . Springer Science and Business Media Deutschland GmbH.
Yang, S.H., Demir, K.C., Weise, T., Schmid, A., May, M., & Maier, A. (2023). PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images . In Proceedings of the International Conference Interspeech 2023 (pp. 5). Dublin, IE.
Yang, S.H., Pérez Toro, P.A., Weise, T., Hoffmann, B., Demir, K.C., Nöth, E.,... Maier, A. (2023). Multi-Modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video . In International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare) (pp. 9). Hawaii Convention Center, 1801 Kalākaua Ave, Honolulu, HI 96815, United States, US.
Yang, S.H., Weise, T., & Demir, K.C. (2023). Impact of Including Pathological Speech in Pre-Training on Pathology Detection . In Proceedings of the Text, Speech, and Dialogue. Satellite event of Interspeech 22023 (pp. 12). Pilsen, CZ.
2022
Demir, K.C., May, M., Schmid, A., Uder, M., Breininger, K., Weise, T.,... Yang, S.H. (2022). PoCaP Corpus: A Multimodal Dataset for Smart Operating Room Speech Assistant Using Interventional Radiology Workflow Analysis . In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 464-475). Brno, CZ: Springer Science and Business Media Deutschland GmbH.
Hernandez, A., Klumpp, P., Das, B.K., Maier, A., & Yang, S.H. (2022). Autoblog 2021: The Importance of Language Models for Spontaneous Lecture Speech . In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (Eds.), Text, Speech, and Dialogue 25th International Conference, TSD 2022, Brno, Czech Republic, September 6–9, 2022, Proceedings (pp. 291-300). Brno, Czech Republic, CZ: Springer Nature Switzerland AG: Springer Cham.
Hernandez, A., Pérez Toro, P.A., Nöth, E., Orozco Arroyave, J.R., Maier, A., & Yang, S.H. (2022). Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition . In Proceedings of Interspeech 2022 (pp. 51-55). Seoul, KR.
Maier, A., Köstler, H., Heisig, M., Krauss, P., & Yang, S.H. (2022). Known operator learning and hybrid machine learning in medical imaging - A review of the past, the present, and the future . Progress in Biomedical Engineering , 4 (2). https://doi.org/10.1088/2516-1091/ac5b13
Maier, A., Yang, S.H., Maleki, F., Muthukrishnan, N., & Forghani, R. (2022). Offer Proprietary Algorithms Still Protection of Intellectual Property in the Age of Machine Learning?: A Case Study Using Dual Energy CT Data . In Klaus Maier-Hein, Thomas M. Deserno, Heinz Handels, Andreas Maier, Christoph Palm, Thomas Tolxdorff (Eds.), Informatik aktuell (pp. 345-350). Heidelberg, DEU: Springer Science and Business Media Deutschland GmbH.
Sindel, A., Hernandez, A., Yang, S.H., Christlein, V., & Maier, A. (2022). SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks . In Proceedings of the OAGM Workshop 2021. Computer Vision and Pattern Analysis Across Domains . Verlag der Technischen Universität Graz.
Weise, T., Maier, A., Nöth, E., Heismann, B., Schuster, M., & Yang, S.H. (2022). Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment . In Proceedings of the Proceedings of INTERSPEECH 2022 . Songdo, KR.
2021
2020
Related Research Fields
Contact:
For the past six years, I have been working in automatic speech recognition, speech synthesis, and conversion, In Erlangen, the key focus is to apply speech engineering technologies in the field of the biomedical domain.
Current projects
No projects found.
Recent publications
2024
2023
2022
2021
2020
Related Research Fields
Contact: