| Classifications | Code | Applications | Subscribe |
|---|
| |
| LINGUISTICS | 704/001000 | 0 | |
| |
| Translation machine | 704/002000 | 0 | |
| |
| Having particular Input/Output device | 704/003000 | 0 | |
| |
| Based on phrase, clause, or idiom | 704/004000 | 0 | |
| |
| For partial translation | 704/005000 | 0 | |
| |
| Punctuation | 704/006000 | 0 | |
| |
| Storage or retrieval of data | 704/007000 | 1 | |
| |
| Multilingual or national language support | 704/008000 | 0 | |
| |
| Natural language | 704/009000 | 2 | |
| |
| Dictionary building, modification, or prioritization | 704/010000 | 0 | |
| |
| SPEECH SIGNAL PROCESSING | 704/200000 | 0 | |
| |
| Psychoacoustic | 704/200100 | 0 | |
| |
| For storage or transmission | 704/201000 | 0 | |
| |
| Neural network | 704/202000 | 0 | |
| |
| Transformation | 704/203000 | 0 | |
| |
| Orthogonal functions | 704/204000 | 0 | |
| |
| Frequency | 704/205000 | 0 | |
| |
| Specialized information | 704/206000 | 0 | |
| |
| Pitch | 704/207000 | 0 | |
| |
| Voiced or unvoiced | 704/208000 | 0 | |
| |
| Formant | 704/209000 | 0 | |
| |
| Silence decision | 704/210000 | 0 | |
| |
| Time | 704/211000 | 0 | |
| |
| Pulse code modulation (PCM) | 704/212000 | 0 | |
| |
| Zero crossing | 704/213000 | 0 | |
| |
| Voiced or unvoiced | 704/214000 | 0 | |
| |
| Silence decision | 704/215000 | 0 | |
| |
| Correlation function | 704/216000 | 0 | |
| |
| Autocorrelation | 704/217000 | 0 | |
| |
| Cross-correlation | 704/218000 | 0 | |
| |
| Linear prediction | 704/219000 | 0 | |
| |
| Analysis by synthesis | 704/220000 | 0 | |
| |
| Pattern matching vocoders | 704/221000 | 0 | |
| |
| Vector quantization | 704/222000 | 0 | |
| |
| Excitation patterns | 704/223000 | 0 | |
| |
| Normalizing | 704/224000 | 0 | |
| |
| Gain control | 704/225000 | 0 | |
| |
| Noise | 704/226000 | 0 | |
| |
| Pretransmission | 704/227000 | 0 | |
| |
| Post-transmission | 704/228000 | 0 | |
| |
| Adaptive bit allocation | 704/229000 | 0 | |
| |
| Quantization | 704/230000 | 0 | |
| |
| Recognition | 704/231000 | 0 | |
| |
| Neural network | 704/232000 | 0 | |
| |
| Detect speech in noise | 704/233000 | 1 | |
| |
| Normalizing | 704/234000 | 0 | |
| |
| Speech to image | 704/235000 | 3 | |
| |
| Specialized equations or comparisons | 704/236000 | 0 | |
| |
| Correlation | 704/237000 | 0 | |
| |
| Distance | 704/238000 | 0 | |
| |
| Similarity | 704/239000 | 0 | |
| |
| Probability | 704/240000 | 1 | |
| |
| Dynamic time warping | 704/241000 | 0 | |
| |
| Viterbi trellis | 704/242000 | 0 | |
| |
| Creating patterns for matching | 704/243000 | 0 | |
| |
| Update patterns | 704/244000 | 0 | |
| |
| Clustering | 704/245000 | 0 | |
| |
| Voice recognition | 704/246000 | 2 | |
| |
| Preliminary matching | 704/247000 | 0 | |
| |
| Endpoint detection | 704/248000 | 0 | |
| |
| Subportions | 704/249000 | 0 | |
| |
| Specialized models | 704/250000 | 0 | |
| |
| Word recognition | 704/251000 | 0 | |
| |
| Preliminary matching | 704/252000 | 0 | |
| |
| Endpoint detection | 704/253000 | 0 | |
| |
| Subportions | 704/254000 | 0 | |
| |
| Specialized models | 704/255000 | 0 | |
| |
| Markov | 704/256000 | 0 | |
| |
| Hidden Markov Model (HMM) (EPO) | 704/256100 | 0 | |
| |
| Training of HMM (EPO) | 704/256200 | 0 | |
| |
| With insufficient amount of training data, e.g., state sharing, tying, deleted interpolation (EPO) | 704/256300 | 0 | |
| |
| Duration modeling in HMM, e.g., semi HMM, segmental models, transition probabilities (EPO) | 704/256400 | 0 | |
| |
| Hidden Markov (HM) network (EPO) | 704/256500 | 0 | |
| |
| State emission probability (EPO) | 704/256600 | 0 | |
| |
| Continuous density, e.g, Gaussian distribution, Lapalce (EPO) | 704/256700 | 0 | |
| |
| Discrete density, e.g., Vector Quantization preprocessor, look up tables (EPO) | 704/256800 | 0 | |
| |
| Natural language | 704/257000 | 1 | |
| |
| Synthesis | 704/258000 | 0 | |
| |
| Neural network | 704/259000 | 0 | |
| |
| Image to speech | 704/260000 | 0 | |
| |
| Vocal tract model | 704/261000 | 0 | |
| |
| Linear prediction | 704/262000 | 0 | |
| |
| Correlation | 704/263000 | 0 | |
| |
| Excitation | 704/264000 | 0 | |
| |
| Interpolation | 704/265000 | 0 | |
| |
| Specialized model | 704/266000 | 0 | |
| |
| Time element | 704/267000 | 0 | |
| |
| Frequency element | 704/268000 | 0 | |
| |
| Transformation | 704/269000 | 0 | |
| |
| Application | 704/270000 | 0 | |
| |
| Speech assisted network | 704/270100 | 0 | |
| |
| Handicap aid | 704/271000 | 0 | |
| |
| Novelty item | 704/272000 | 0 | |
| |
| Security system | 704/273000 | 0 | |
| |
| Warning/alarm system | 704/274000 | 0 | |
| |
| Speech controlled system | 704/275000 | 0 | |
| |
| Pattern display | 704/276000 | 0 | |
| |
| Translation | 704/277000 | 0 | |
| |
| Sound editing | 704/278000 | 0 | |
| |
| AUDIO SIGNAL BANDWIDTH COMPRESSION OR EXPANSION | 704/500000 | 0 | |
| |
| With content reduction encoding | 704/501000 | 0 | |
| |
| Delay line | 704/502000 | 0 | |
| |
| AUDIO SIGNAL TIME COMPRESSION OR EXPANSION (E.G., RUN LENGTH CODING) | 704/503000 | 0 | |
| |
| With content reduction encoding | 704/504000 | 0 | |
| |
| SPEAKER IDENTIFICATION OR VERIFICATION (EPO) | 704/E17001 | 0 | |
| |
| Recognition of special voice characteristics, e.g., for use in a lie detector; recognition of animal voices, etc. (EPO) | 704/E17002 | 0 | |
| |
| Systems using speaker recognizers (EPO) | 704/E17003 | 0 | |
| |
| Details (EPO) | 704/E17004 | 0 | |
| |
| Preprocessing operations, e.g., segment selection, etc., pattern representation or modeling, e.g., based on linear discriminant analysis (LDA), principal components, etc.; feature selection or extraction (EPO) | 704/E17005 | 0 | |
| |
| Training, model building, enrollment (EPO) | 704/E17006 | 0 | |
| |
| Decision making techniques, pattern matching strategies (EPO) | 704/E17007 | 0 | |
| |
| Use of particular distance or distortion metric between probe pattern and reference templates (EPO) | 704/E17008 | 0 | |
| |
| Multimodal systems, i.e., based on the integration of multiple recognition engines or experts fusion (EPO) | 704/E17009 | 0 | |
| |
| Score normalization (EPO) | 704/E17010 | 0 | |
| |
| Use of phonemic categorization or speech recognition prior to speaker recognition or verification (EPO) | 704/E17011 | 0 | |
| |
| Hidden Markov Models (HMMs) (EPO) | 704/E17012 | 0 | |
| |
| Artificial neural networks, connectionist approaches (EPO) | 704/E17013 | 0 | |
| |
| Pattern transformations and operations aimed at increasing system robustness, e.g., against channel noise, different working conditions, etc. (EPO) | 704/E17014 | 0 | |
| |
| Interactive procedures, man-machine interface (EPO) | 704/E17015 | 0 | |
| |
| User prompted to utter a password or predefined text (EPO) | 704/E17016 | 0 | |
| |
| SPEECH RECOGNITION (EPO) | 704/E15001 | 1 | |
| |
| Assessment or evaluation of speech recognition systems (EPO) | 704/E15002 | 0 | |
| |
| Language recognition (EPO) | 704/E15003 | 0 | |
| |
| Feature extraction for speech recognition; selection of recognition unit (EPO) | 704/E15004 | 1 | |
| |
| Segmentation or word limit detection (EPO) | 704/E15005 | 0 | |
| |
| Word boundary detection (EPO) | 704/E15006 | 0 | |
| |
| Creation of reference templates; training of speech recognition systems, e.g., adaption to the characteristics of the speaker's voice, etc. (EPO) | 704/E15007 | 0 | |
| |
| Training (EPO) | 704/E15008 | 0 | |
| |
| Adaptation (EPO) | 704/E15009 | 0 | |
| |
| In the frequency domain (EPO) | 704/E15010 | 0 | |
| |
| To speaker (EPO) | 704/E15011 | 0 | |
| |
| Supervised, i.e., under machine guidance (EPO) | 704/E15012 | 0 | |
| |
| Unsupervised (EPO) | 704/E15013 | 0 | |
| |
| Speech classification or search (EPO) | 704/E15014 | 0 | |
| |
| Using distance or distortion measures between unknown speech and reference templates (EPO) | 704/E15015 | 0 | |
| |
| Using dynamic programming techniques, e.g., Dynamic Time Warping (DTW), etc. (EPO) | 704/E15016 | 0 | |
| |
| Using artificial neural networks (EPO) | 704/E15017 | 0 | |
| |
| Using natural language modeling (EPO) | 704/E15018 | 0 | |
| |
| Using context dependencies, e.g., language models, etc. (EPO) | 704/E15019 | 0 | |
| |
| Phonemic context, e.g., pronunciation rules, phonotactical constraints, phoneme n-grams, etc. (EPO) | 704/E15020 | 0 | |
| |
| Grammatical context, e.g., disambiguation of the recognition hypotheses based on word sequence rules, etc. (EPO) | 704/E15021 | 0 | |
| |
| Formal grammars, e.g., finite state automata, context free grammars, word networks, etc. (EPO) | 704/E15022 | 0 | |
| |
| Probabilistic grammars, e.g., word n-grams, etc. (EPO) | 704/E15023 | 0 | |
| |
| Semantic context, e.g., disambiguation of the recognition hypotheses based on word meaning, etc. (EPO) | 704/E15024 | 0 | |
| |
| Using prosody or stress (EPO) | 704/E15025 | 0 | |
| |
| Parsing for meaning understanding (EPO) | 704/E15026 | 0 | |
| |
| Using statistical models, e.g., Hidden Markov Models (HMMs), etc. (EPO) | 704/E15027 | 0 | |
| |
| Hidden Markov Models (HMMs) (EPO) | 704/E15028 | 0 | |
| |
| Training of Hidden Markov Models (HMMs) (EPO) | 704/E15029 | 0 | |
| |
| With insufficient amount of training data, e.g., state sharing, tying, deleted interpolation, etc. (EPO) | 704/E15030 | 0 | |
| |
| Duration modeling in Hidden Markov Models (HMMs), e.g., semi-HMM, segmental models, transition probabilities, etc. (EPO) | 704/E15031 | 0 | |
| |
| Hidden Markov Models (HMMs) network (EPO) | 704/E15032 | 0 | |
| |
| State emission probabilities (EPO) | 704/E15033 | 0 | |
| |
| Continuous densities, e.g., Gaussian distribution, Laplace, etc. (EPO) | 704/E15034 | 0 | |
| |
| Discrete densities, e.g., Vector Quantization preprocessor, look-up tables, etc. (EPO) | 704/E15035 | 0 | |
| |
| Neural Network (NN) as output probability estimator, e.g., hybrid HMM/NN, etc. (EPO) | 704/E15036 | 0 | |
| |
| Non-hidden Markov Model (EPO) | 704/E15037 | 0 | |
| |
| Recognition networks (EPO) | 704/E15038 | 0 | |
| |
| Speech recognition techniques for robustness in adverse environments, e.g., in noise, of stress induced speech, etc. (EPO) | 704/E15039 | 0 | |
| |
| Procedures used during a speech recognition process, e.g., man-machine dialogue, etc. (EPO) | 704/E15040 | 0 | |
| |
| Speech recognition using nonacoustical features, e.g., position of the lips, etc. (EPO) | 704/E15041 | 0 | |
| |
| Using position of the lips, movement of the lips, or face analysis (EPO) | 704/E15042 | 0 | |
| |
| Speech to text systems (EPO) | 704/E15043 | 1 | |
| |
| Speech recognition depending on application context, e.g., in a computer, etc. (EPO) | 704/E15044 | 0 | |
| |
| Systems using speech recognizers (EPO) | 704/E15045 | 0 | |
| |
| Constructional details of speech recognition systems (EPO) | 704/E15046 | 0 | |
| |
| Distributed recognition, e.g., in client-server systems for mobile phones or network applications, etc. (EPO) | 704/E15047 | 0 | |
| |
| Memory allocation or algorithm optimization to reduce hardware requirements (EPO) | 704/E15048 | 0 | |
| |
| Multiple recognizers used in sequence or in parallel; corresponding voting or score combination systems (EPO) | 704/E15049 | 0 | |
| |
| Recognizers for parallel processing (EPO) | 704/E15050 | 0 | |
| |
| SPEECH OR AUDIO SIGNAL ANALYSIS-SYNTHESIS TECHNIQUES FOR REDUNDANCY REDUCTION, E.G., IN VOCODERS, ETC.; CODING OR DECODING OF SPEECH OR AUDIO SIGNALS; COMPRESSION OR EXPANSION OF SPEECH OR AUDIO SIGNALS, E.G., SOURCE-FILTER MODELS, PSYCHOACOUSTIC ANALYSIS, ETC. (EPO) | 704/E19001 | 0 | |
| |
| Perceptual measures for quality assessment (EPO) | 704/E19002 | 0 | |
| |
| Correction of errors induced by the transmission channel, if related to the coding (EPO) | 704/E19003 | 0 | |
| |
| Lossless audio signal coding; perfect reconstruction of coded audio signal by transmission of coding error (EPO) | 704/E19004 | 0 | |
| |
| Multichannel audio signal coding and decoding, i.e., using interchannel correlation to reduce redundancies, e.g., joint-stereo, intensity-coding, matrixing, etc. (EPO) | 704/E19005 | 0 | |
| |
| Comfort noise, silence coding (EPO) | 704/E19006 | 0 | |
| |
| Speech coding using phonetic or linguistical decoding of the source; reconstruction using text-to-speech synthesis (EPO) | 704/E19007 | 0 | |
| |
| Systems using vocoders (EPO) | 704/E19008 | 0 | |
| |
| Audio watermarking, i.e., embedding inaudible data in the audio signal (EPO) | 704/E19009 | 0 | |
| |
| Using spectral analysis, e.g., transform vocoders, subband vocoders, perceptual audio coders, psychoacoustically based lossy encoding, etc., e.g., MPEG audio, Dolby AC-3, etc. (EPO) | 704/E19010 | 0 | |
| |
| Blocking, i.e., grouping of samples in time, choice of analysis window, overlap factor (EPO) | 704/E19011 | 0 | |
| |
| Detection of transients and attacks for time/frequency resolution switching (EPO) | 704/E19012 | 0 | |
| |
| Noise substitution, i.e., substituting nontonal spectral components by noisy source (EPO) | 704/E19013 | 0 | |
| |
| Spectral prediction for pre-echo prevention; temporal noise shaping (TNS), e.g., in MPEG2 or MPEG4, etc. (EPO) | 704/E19014 | 0 | |
| |
| Quantization or dequantization of spectral components (EPO) | 704/E19015 | 0 | |
| |
| Scalar quantization (EPO) | 704/E19016 | 0 | |
| |
| Vector quantization, e.g., Twin-VQ audio, etc. (EPO) | 704/E19017 | 0 | |
| |
| Using subband decomposition (EPO) | 704/E19018 | 0 | |
| |
| Subband vocoders (EPO) | 704/E19019 | 0 | |
| |
| Using orthogonal transformation (EPO) | 704/E19020 | 0 | |
| |
| Using wavelet decomposition (EPO) | 704/E19021 | 0 | |
| |
| Dynamic bit allocation (EPO) | 704/E19022 | 0 | |
| |
| Using predictive techniques; codecs based on source-filter modelization (EPO) | 704/E19023 | 0 | |
| |
| Determination or coding of the spectral characteristics, e.g., of the short-term prediction coefficients, etc. (EPO) | 704/E19024 | 0 | |
| |
| Line spectrum pair (LSP) vocoders (EPO) | 704/E19025 | 0 | |
| |
| Determination or coding of the excitation function; determination or coding of the long-term prediction characteristics (EPO) | 704/E19026 | 0 | |
| |
| Determination or coding of an excitation gain (EPO) | 704/E19027 | 0 | |
| |
| Using mixed excitation model, e.g., MELP, MBE, Split band LPC, HVXC, etc. (EPO) | 704/E19028 | 0 | |
| |
| Long-term prediction, i.e., removing periodical redundancies, e.g., adaptive codebook, pitch predictor, etc. (EPO) | 704/E19029 | 0 | |
| |
| Using sinusoidal excitation model (EPO) | 704/E19030 | 0 | |
| |
| Using prototype waveform decomposition or waveform interpolative coders (PWI) (EPO) | 704/E19031 | 0 | |
| |
| Determination or coding of a multipulse excitation (EPO) | 704/E19032 | 0 | |
| |
| Algebraic codebook; sparse pulse excitation (EPO) | 704/E19033 | 0 | |
| |
| Regular pulse excitation (EPO) | 704/E19034 | 0 | |
| |
| Determination or coding of a code excitation; code excited linear prediction (CELP) vocoders (EPO) | 704/E19035 | 0 | |
| |
| Pitch excitation, e.g., PSI-CELP (pitch synchronous innovation CELP), etc. (EPO) | 704/E19036 | 0 | |
| |
| Residual excited linear prediction (RELP) (EPO) | 704/E19037 | 0 | |
| |
| Vector sum excited linear prediction (VSELP) (EPO) | 704/E19038 | 0 | |
| |
| Details of speech and audio coders (EPO) | 704/E19039 | 0 | |
| |
| Vocoder architecture (EPO) | 704/E19040 | 0 | |
| |
| Vocoders using multiple modes (EPO) | 704/E19041 | 0 | |
| |
| Using sound class specific coding, hybrid encoders, object-based coding (EPO) | 704/E19042 | 0 | |
| |
| Mode decision, i.e., based on audio signal content versus external parameter (EPO) | 704/E19043 | 0 | |
| |
| Variable rate or variable quality codecs, e.g., scalable representation encoding, etc. (EPO) | 704/E19044 | 0 | |
| |
| Pre- or post-filtering (EPO) | 704/E19045 | 0 | |
| |
| Pre-filtering, e.g., high frequency emphasis prior to encoding, etc. (EPO) | 704/E19046 | 0 | |
| |
| Post-filtering, e.g., pitch enhancement, formant emphasis for decoder, etc. (EPO) | 704/E19047 | 0 | |
| |
| Audio streaming, i.e., formatting and decoding of an encoded audio signal (EPO) | 704/E19048 | 0 | |
| |
| Transcoding, i.e., converting between two coded representations avoiding cascaded coding-decoding (EPO) | 704/E19049 | 0 | |
| |
| MODIFICATION OF AT LEAST ONE CHARACTERISTIC OF SPEECH WAVES (EPO) | 704/E21001 | 0 | |
| |
| Speech enhancement, e.g., noise reduction, echo cancellation, etc. (EPO) | 704/E21002 | 0 | |
| |
| Applications (EPO) | 704/E21003 | 0 | |
| |
| Speech corrupted by noise (EPO) | 704/E21004 | 0 | |
| |
| Periodic noise (EPO) | 704/E21005 | 0 | |
| |
| The noise being separate speech (EPO) | 704/E21006 | 0 | |
| |
| Speech corrupted by echo-reverberation (EPO) | 704/E21007 | 0 | |
| |
| Speech corrupted by stress-Lombard effect (EPO) | 704/E21008 | 0 | |
| |
| Enhancement of intelligibility of clean or coded speech (EPO) | 704/E21009 | 0 | |
| |
| Enhancement of diverse speech (EPO) | 704/E21010 | 0 | |
| |
| Bandwidth extension taking place at the receiving side, e.g., generation of low- or high-frequency components, regeneration of spectral holes, etc. (EPO) | 704/E21011 | 0 | |
| |
| Separate reconstruction of interference and of speech signal (EPO) | 704/E21012 | 0 | |
| |
| The interference being a separate speaker (EPO) | 704/E21013 | 0 | |
| |
| Active noise canceling (EPO) | 704/E21014 | 0 | |
| |
| Public address system (EPO) | 704/E21015 | 0 | |
| |
| Suppression or repetition of time signal segments (EPO) | 704/E21016 | 0 | |
| |
| Time compression or expansion (EPO) | 704/E21017 | 0 | |
| |
| Suppression or repetition of time signal segments (EPO) | 704/E21018 | 0 | |
| |
| Transformation of speech into a nonaudible representation, e.g., speech visualization, speech processing for tactile aids, etc. (EPO) | 704/E21019 | 0 | |
| |
| Synchronization of speech with image or synthesis of the lips movement from speech, e.g., for "talking heads," etc.(EPO) | 704/E21020 | 0 | |
| |
| MISCELLANEOUS ANALYSIS OR DETECTION OF SPEECH CHARACTERISTICS (EPO) | 704/E11001 | 0 | |
| |
| General speech analysis without concrete application (EPO) | 704/E11002 | 0 | |
| |
| Detection of presence or absence of speech signals (EPO) | 704/E11003 | 0 | |
| |
| Voice/data decision (EPO) | 704/E11004 | 0 | |
| |
| End point detection (EPO) | 704/E11005 | 0 | |
| |
| Pitch determination of speech signals (EPO) | 704/E11006 | 0 | |
| |
| Voiced-unvoiced decision (EPO) | 704/E11007 | 0 | |
| |
| SPEECH SYNTHESIS; TEXT TO SPEECH SYSTEMS (EPO) | 704/E13001 | 0 | |
| |
| Methods for producing synthetic speech; speech synthesizers (EPO) | 704/E13002 | 0 | |
| |
| Concept-to-speech synthesizers; generation of natural phrases not from text but from machine-based concepts (EPO) | 704/E13003 | 0 | |
| |
| Sound editing, manipulating voice of the synthesizer (EPO) | 704/E13004 | 0 | |
| |
| Details of speech synthesis systems, e.g., synthesizer architecture, memory management, etc. (EPO) | 704/E13005 | 0 | |
| |
| Architecture of speech synthesizers (EPO) | 704/E13006 | 0 | |
| |
| Excitation (EPO) | 704/E13007 | 0 | |
| |
| Systems using speech synthesizers (EPO) | 704/E13008 | 0 | |
| |
| Elementary speech units used in speech synthesizers; concatenation rules (EPO) | 704/E13009 | 0 | |
| |
| Concatenation (EPO) | 704/E13010 | 0 | |
| |
| Text analysis, generation of parameters for speech synthesis out of text, e.g., grapheme to phoneme translation, prosody generation, stress, or intonation determination, etc. (EPO) | 704/E13011 | 0 | |
| |
| Grapheme to phoneme, detection of language (EPO) | 704/E13012 | 0 | |
| |
| Prosody rules derived from text (EPO) | 704/E13013 | 0 | |
| |
| Stress or intonation (EPO) | 704/E13014 | 0 | |
| |
| CLASS-RELATED FOREIGN DOCUMENTS | 704/FOR000 | 0 | |
| |