چکیده To protect privacy and meet legal regulations, federated learning (FL) has gained significant attention for training speech-to-text (S2T) systems, including automatic speech recognition (ASR) and speech translation (ST). However, the commonly used FL approach (i.e., \textsc{FedAvg}) in S2T tasks typically suffers from extensive communication overhead due to multi-round interactions based on the whole model and performance degradation caused by data heterogeneity among clients.To address these issues, we propose a personalized federated S2T framework that introduces \textsc{FedLoRA}, a lightweight LoRA module for client-side tuning and interaction with the server to minimize communication overhead, and \textsc{FedMem}, a global model equipped with a $k$-nearest-neighbor ($k$NN) classifier that captures client-specific distributional shifts to achieve personalization and overcome data heterogeneity. Extensive experiments based on Conformer and Whisper backbone models on CoVoST and GigaSpeech benchmarks show that our approach significantly reduces the communication overhead on all S2T tasks and effectively personalizes the global model to overcome data heterogeneity.

چکیده به فارسی (ترجمه ماشینی) برای محافظت از حریم خصوصی و رعایت مقررات قانونی ، یادگیری فدراسیون (FL) برای آموزش سیستم های گفتار به متن (S2T) ، از جمله تشخیص خودکار گفتار (ASR) و ترجمه گفتار (ST) توجه قابل توجهی را به خود جلب کرده است.با این حال ، رویکرد FL متداول (به عنوان مثال ، \ textsc {fedavg}) در کارهای S2T به طور معمول از سربار ارتباطات گسترده به دلیل تعامل چند دور بر اساس کل مدل و تخریب عملکرد ناشی از ناهمگونی داده در بین مشتریان رنج می برد. برای پرداختن به این موضوعات، ما یک چارچوب S2T فدراسیون شخصی را پیشنهاد می کنیم که \ textsc {fedlora} ، یک ماژول لورا سبک وزن برای تنظیم سمت مشتری و تعامل با سرور برای به حداقل رساندن سربار ارتباطات ، و \ textsc {fedmem} ، یک مدل جهانی مجهز به $ k است.طبقه بندی کننده $-Nearest-Deighbor ($ k $ nn) که برای دستیابی به شخصی سازی و غلبه بر ناهمگونی داده ها ، تغییرات توزیع خاص مشتری را ضبط می کند.آزمایش های گسترده ای بر اساس مدل های ستون فقرات Conformer و Whisper بر روی معیارهای گوسفند و gigaspeech نشان می دهد که رویکرد ما به طور قابل توجهی سربار ارتباطات را در تمام کارهای S2T کاهش می دهد و به طور مؤثر مدل جهانی را برای غلبه بر ناهمگونی داده ها شخصی می کند.

زبان مقاله انگلیسی

عنوان مقاله به انگلیسی Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks

عنوان مقاله به فارسی یادگیری فدرال شخصی شده ارتباطی کارآمد برای تسک های گفتار به متن

فرمت مقاله PDF

نویسندگان Yichao Du, Zhirui Zhang, Linan Yue, Xu Huang, Yuqing Zhang, Tong Xu, Linli Xu, Enhong Chen

مشاهده بیشتر

نظرات کاربران

هیچ نظری ثبت نشده است.

کالاهای مشابه

جدیدترین‌های محتوای آموزشی فدرال

محتوای آموزشی مقاله یادگیری فدرال شخصی شده ارتباطی کارآمد برای ت...

از ۲۵,۰۰۰

محتوای آموزشی مقاله Feddrivescore: رفتار محرک امتیاز دهی فدرال ب...

از ۲۵,۰۰۰

محتوای آموزشی ترجمه مقاله یک طرح یادگیری فدرال مبتنی بر شیب مشتر...

از ۲۴۰,۰۰۰

دانلود کتاب Central Banking Systems Compared: The ECB, The Pre-Euro Bundesbank and the Federal Reserve System

از ۱۹,۵۰۰

محتوای آموزشی دانلود کتاب Federal Agents: The Growth of Federal Law Enforcement in America

از ۱۹,۵۰۰

$دانلود کتاب The Fourth Branch: The Federal Reserve\\'s Unlikely Rise to Power and Influence$

دانلود کتاب The Fourth Branch: The Federal Reserve\\'s Unlikely Rise to Power and Influence

از ۱۹,۵۰۰

دانلود کتاب Reinventing Rationality: The Role of Regulatory Analysis in the Federal Bureaucracy

از ۱۹,۵۰۰

دانلود کتاب Federal Accounting Handbook: Policies, Standards, Procedures, Practices

از ۱۹,۵۰۰

دانلود کتاب Aggressive Nationalism: McCulloch v. Maryland and the Foundation of Federal Authority in the Young Republic

از ۱۹,۵۰۰

دانلود کتاب Privatizing Fannie Mae, Freddie Mac and the Federal Home Loan Banks: Why and How

از ۱۹,۵۰۰

دانلود کتاب McCulloch V. Maryland: Implied Powers of the Federal Government (Great Supreme Court Decisions)

از ۱۹,۵۰۰

دانلود کتاب From Cotton Belt to Sunbelt: Federal Policy, Economic Development, and the Transformation of the South, 1938-1980

از ۱۹,۵۰۰

دانلود کتاب Creek Paths and Federal Roads: Indians, Settlers, and Slaves and the Making of the American South

از ۱۹,۵۰۰

دانلود کتاب From Good Will to Civil Rights: Transforming Federal Disability Policy; Second Edition (Health, Society, and Policy)

از ۱۹,۵۰۰

دانلود کتاب Committee Decisions on Monetary Policy: Evidence from Historical Records of the Federal Open Market Committee

از ۱۹,۵۰۰

محتوای آموزشی دانلود کتاب E-Government and Web Directory: U.S. Federal Government Online

از ۱۹,۵۰۰

$دانلود کتاب Observations on the President\\'s Fiscal Year 2003 Federal Science and Technology Budget$

دانلود کتاب Observations on the President\\'s Fiscal Year 2003 Federal Science and Technology Budget

از ۱۹,۵۰۰

محتوای آموزشی دانلود کتاب The Federal Reserve: A New History – فدرال رزرو: یک تاریخ جدید

از ۲۴,۰۰۰

دانلود کتاب Federated Learning: Fundamentals and Advances – یادگیری فدرال: مبانی و پیشرفت ها

از ۲۴,۰۰۰

دانلود کتاب Probing the Bureaucratic Mind: About Canadian Federal Executives – بررسی ذهن بوروکراتیک: درباره مدیران فدرال کانادا

از ۲۴,۰۰۰