TensorFlow implementation of Wav2Vec2

This repository presents an implementation of the Wav2Vec2 model [1] in TensorFlow 2.0 as a part of Google Summer of Code. For a quick demo, please check out this. Final report of the project can be found here. Notebooks The repository comes with shiny Colab Notebooks. Below you can find a list of them. Spin them up and don’t forget to have fun! Checkpoints Below is a summary of checkpoints obtained during the project: To know more about the process […]

Read more

A Python package for easy multiprocessing

MPIRE, short for MultiProcessing Is Really Easy, is a Python package for multiprocessing, but faster and more user-friendly than the default multiprocessing package. It combines the convenient map like functions of multiprocessing.Pool with the benefits of using copy-on-write shared objects of multiprocessing.Process, together with easy-to-use worker state, worker insights, and progress bar functionality. Full documentation is available at https://slimmer-ai.github.io/mpire/. Features Faster execution than other multiprocessing libraries. See benchmarks. Intuitive, Pythonic syntax Multiprocessing with map/map_unordered/imap/imap_unordered functions Easy use of copy-on-write shared […]

Read more

Robust Video Matting in PyTorch and TensorFlow

Official repository for the paper Robust High-Resolution Video Matting with Temporal Guidance. RVM is specifically designed for robust human video matting. Unlike existing neural models that process frames as independent images, RVM uses a recurrent neural network to process videos with temporal memory. RVM can perform matting in real-time on any videos without additional inputs. It achieves 4K 76FPS and HD 104FPS on an Nvidia GTX 1080 Ti GPU. The project was developed at ByteDance Inc. News [Aug 25 2021] […]

Read more

A python command line toolfor archiving Twitter JSON

Collect data at the command line from the Twitter API (v1.1 and v2). Contributing Documentation The documentation is managed at ReadTheDocs. If you would like to improve the documentation you can edit the Markdown files in docs or add new ones. Then send a pull request and we can add it. To view your documentation locally you should be able to: pip install -r requirements-mkdocs.txt mkdocs serve open http://127.0.0.1:8000/ If you prefer you can create a page on the wiki […]

Read more

Make your AirPlay devices as TTS speakers

Home Assistant integration component, make your AirPlay devices as TTS speakers. 2021.6.X or earlier Apple Airplayer component requires pyatv 0.8.1, which is self-contained in the latest version Home Assistant (2021.7.3). You can run pip list | grep pyqatv in your Home Assistant container host to check the version of pyatv. If lower than 0.8.1, you should run commands as below to upgrade pyatv. apk update apk add build-base pip3 install –upgrade pyatv pip3 install –upgrade attrs 2021.7.X or later There […]

Read more

Bringing Ethereum Virtual Machine to StarkNet at warp speed

Warp brings EVM compatible languages to StarkNet, making it possible to transpile Ethereum smart contracts to Cairo, and use them on StarkNet. Installation⚙️ Linux: sudo apt update sudo apt install software-properties-common sudo add-apt-repository ppa:deadsnakes/ppa sudo apt update sudo apt install -y python3.7 sudo apt install -y python3.7-dev sudo apt install -y libgmp3-dev python3.7 -m venv ~/warp_demo source ~/warp_demo/bin/activate pip install wheel pip install ecdsa fastecdsa sympy pip install cairo-lang==0.3.1 make warp    

Read more

A Diverse and Non-parallel Framework for Natural-Sounding Voice Conversion

Yinghao Aaron Li, Ali Zare, Nima Mesgarani We present an unsupervised non-parallel many-to-many voice conversion (VC) method using a generative adversarial network (GAN) called StarGAN v2. Using a combination of adversarial source classifier loss and perceptual loss, our model significantly outperforms previous VC models. Although our model is trained only with 20 English speakers, it generalizes to a variety of voice conversion tasks, such as any-to-many, cross-lingual, and singing conversion. Using a style encoder, our framework can also convert plain […]

Read more

An open source Jetson Nano baseboard and tools to design your own

This basic baseboard gives the user the foundation and the flexibility to design their own baseboard for the Jetson Nano. It also repurposes some of the Jetson Nano’s interface signals for simple beginner projects. This baseboard, as designed, contains: A 5V, 4A DC barrel jack 4 USB 2.0 connectors An HDMI connector A UART-to-USB bridge A debug USB A 40-pin GPIO A servo header Three user-interactive buttons (power, reset, and force recovery) A flex connector for an OLED display Use […]

Read more
1 2 3 51