Using VideoBERT to tackle video prediction

VideoBERT This repo reproduces the results of VideoBERT (https://arxiv.org/pdf/1904.01766.pdf). Inspiration was taken from https://github.com/MDSKUL/MasterProject, but this repo tackles video prediction rather than captioning and masked language modeling. On a side note, since this model is extremely small, the results that are displayed here are extremely basic. Feel free to increase the model size per your computational resources and change the inference file to include temperature if necessary (As of now I have not implemented temperature). Here are all the steps […]

Read more

Multi-Task Vision and Language Representation Learning

12-in-1: Multi-Task Vision and Language Representation Learning Please cite the following if you use this code. Code and pre-trained models for 12-in-1: Multi-Task Vision and Language Representation Learning: @InProceedings{Lu_2020_CVPR, author = {Lu, Jiasen and Goswami, Vedanuj and Rohrbach, Marcus and Parikh, Devi and Lee, Stefan}, title = {12-in-1: Multi-Task Vision and Language Representation Learning}, booktitle = {The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2020} } and ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for […]

Read more

Vision-Language Pre-training on Fashion Domain

Kaleido-BERT: Vision-Language Pre-training on Fashion Domain Mingchen Zhuge*, Dehong Gao*, Deng-Ping Fan#, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao. Introduction We present a new vision-language (VL) pre-training model dubbed Kaleido-BERT, which introduces a novel kaleido strategy for fashion cross-modality representations from transformers. In contrast to random masking strategy of recent VL models, we design alignment guided masking to jointly focus more on image-text semantic relations.To this end, we carry out five novel tasks, ie, rotation, jigsaw, camouflage, […]

Read more

A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

ALFRED ALFRED (Action Learning From Realistic Environments and Directives), is a new benchmark for learning a mapping from natural language instructions and egocentric vision to sequences of actions for household tasks. Long composition rollouts with non-reversible state changes are among the phenomena we include to shrink the gap between research benchmarks and real-world applications. What more? Checkout ALFWorld – interactive TextWorld environments for ALFRED scenes! Quickstart Clone repo: $ git clone https://github.com/askforalfred/alfred.git alfred $ export ALFRED_ROOT=$(pwd)/alfred Install requirements: $ virtualenv […]

Read more

DeLighT: Very Deep and Light-weight Transformers

DeLighT: Very Deep and Light-weight Transformers This repository contains the source code of our work on building efficient sequence models: DeFINE (ICLR’20) and DeLighT (preprint). Overview In this repository, we share the source code of our paper DeLight, that delivers similar or better performance thantransformer-based models with significantly fewer parameters. DeLighT more efficiently allocates parameters both (1)within each Transformer block using DExTra, a deep and light-weight transformation and (2) across blocks usingblock-wise scaling, that allows for shallower and narrower DeLighT […]

Read more

A powerful and flexible machine learning platform for drug discovery

TorchDrug TorchDrug is a PyTorch-based machine learning toolbox designed for several purposes. Easy implementation of graph operations in a PyTorchic style with GPU support Being friendly to practioners with minimal knowledge about drug discovery Rapid prototyping of machine learning research Installation TorchDrug is compatible with Python >= 3.5 and PyTorch >= 1.4.0. From Conda conda install -c milagraph -c conda-forge torchdrug From Source TorchDrug depends on rdkit, which is only available via conda.You can install rdkit with the following line. […]

Read more

A lightweight wrapper for the IG Markets API written in Python

trading_ig A lightweight wrapper for the IG Markets API written in Python. Simplifies access to the IG REST and Streaming APIs with a live or demo account. What is it? IG Markets provides financial spread betting and CFD platforms for trading equities, forex, commodities, indices, cryptocurrencies, bonds, rates, options and more. IG provide APIs so that developers can access their platforms programmatically. Using the APIs you can get live and historical data, automate your trades, or create apps. For details […]

Read more

Python implementation of the multistate Bennett acceptance ratio

pymbar Python implementation of the multistate Bennett acceptance ratio (MBAR) method for estimating expectations and free energy differences from equilibrium samples from multiple probability densities. Installation The easiest way to install the pymbar release is via conda: conda install -c conda-forge pymbar You can also install pymbar from the Python package index using pip: pip install pymbar The development version can be installed directly from github via pip: pip install git+https://github.com/choderalab/pymbar.git Usage Basic usage involves importing pymbar and constructing an […]

Read more

Transform-Invariant Non-Negative Matrix Factorization

Transform-Invariant Non-Negative Matrix Factorization A comprehensive Python package for Non-Negative Matrix Factorization (NMF) with a focus on learning transform-invariant representations. The packages supports multiple optimization backends and can be easily extended to handle application-specific types of transforms. A general introduction to Non-Negative Matrix Factorization and the purpose of this package can be found on the corresponding GitHub Pages. For using this package, you will need Python version 3.7 (or higher).The package is available via PyPI. Installation is easiest using pip: […]

Read more

Library to interact with the lbrynet client from the LBRY project

A library of functions that can be used to manage the download of claims from the LBRY network. It includes methods to download claims by URI (canonical url), claim ID, or from specific channels. It also includes methods to clean up older files and free space, so the functions are suitable for use in a headless computer that will download files, and seed them to the network with little user intervention. This libary is released as free software under the […]

Read more
1 30 31 32 33 34 51