Month: January 2023

Microsoft’s VALL-E can synthesize your voice from 3 sec of audio

January 28, 2023 / admin / 0 Comments

Microsoft has introduced a new language modeling approach for text-to-speech synthesis (TTS) called VALL-E. The approach uses discrete codes derived from an off-the-shelf neural audio codec model, and is trained using 60K hours of English speech, which is hundreds of times larger than existing systems, and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as an acoustic prompt (project page, paper).

An unofficial Pytorch implementation for VALL-E is available on GitHub.

Microsoft presents BioGPT, a domain-specific generative model that achieved human parity

January 27, 2023 / admin / 0 Comments

Microsoft Research presented with BioGPT a domain-specific generative model that was trained on biomedical literature and achieved best-in-class performance above the human level [paper, article].

Meta presents MAV3D, a text to 3D-video model

January 27, 2023 / admin / 0 Comments

Meta presented with MAV3D (Make-A-Video-3D) a method for generating 4D content, i.e. a 3D video, from a text description by using a 4D dynamic Neural Radiance Field (NeRF) [project page, paper]. Unfortunately, the source code has not been released.

Google publishes MusicLM, a text to music language model

January 27, 2023 / admin / 0 Comments

Google Research published an impressive language model that can turn a text description into high-quality music [webpage, paper]. The source code is unfortunately not publicly available.

Google’s Med-PaLM comes close to human performance in clinical knowledge

January 12, 2023 / admin / 0 Comments

In a recent paper from Dec 26, 2022, Google demonstrates that its large language model Med-PaLM, based on 540 billion parameters with a special instruction prompt tuning for the medical domain, reaches almost clinician’s performance on new medical benchmarks MultiMedQA (benchmark combining six existing open question answering datasets spanning professional medical exams, research, and consumer queries) and HealthSearchQA (a new free-response dataset of medical questions searched online). The evaluation of the answers considering factuality, precision, possible harm, and bias was done by human experts.

GPT-3.5 passes parts of the US legal Bar Exam

January 2, 2023 / admin / 0 Comments

In the United States, most jurisdictions require applicants to pass the Bar Exam in order to practice law. This exam typically requires several years of education and preparation (seven years of post-secondary education, including three years at an accredited law school).

In a publication from Dec 29, 2022, the authors evaluated the performance of GPT-3.5 on the multiple choice part of the exam. While GPT is not yet passing that part of the exam, it significantly exceeded the baseline random chance rate
of 25% and reached the average human passing rate for the categories Evidence and Torts.
On average, GPT is performing about 17% worse than human test-takers across all categories.

Similar to this publication is the report that ChatGPT was able to pass the Wharton Master of Business Applications (MBA) exam.

On March 15, 2023, a paper was published that stated that GPT-4 significantly outperforms both human test-takers and prior models, demonstrating a 26% increase over GPT-3.5 and beating humans in five of seven subject areas.

Month: January 2023

Microsoft’s VALL-E can synthesize your voice from 3 sec of audio

Microsoft presents BioGPT, a domain-specific generative model that achieved human parity

Meta presents MAV3D, a text to 3D-video model

Google publishes MusicLM, a text to music language model

Google’s Med-PaLM comes close to human performance in clinical knowledge

GPT-3.5 passes parts of the US legal Bar Exam

Recent Posts

Recent Comments

Archives

Categories