Part 14: Step by Step Guide to Master NLP – Basics of Topic Modelling

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). In this series, we completed our discussion on the entity extraction technique “Named Entity Recognition (NER)”. But at that time, we didn’t discuss another popular entity extraction technique called Topic Modelling. So, in continuation of that article, we will discuss Topic modelling in this article. In this article, we will discuss firstly some of […]

Read more

Part 17: Step by Step Guide to Master NLP – Topic Modelling using pLSA

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). In the previous article, we discussed a Topic modelling technique named Latent Semantic Analysis (LSA), but we observed that there are some disadvantages of LSA, so to overcome those problems, we come up with the concept of pLSA, which stands for Probabilistic Latent Semantic Analysis. So, In this article, we will deep dive into […]

Read more

Topic extraction From Prime Minister Modi’s Speech

This article was published as a part of the Data Science Blogathon INTRODUCTION Artificial Intelligence (AI) has been a trendy term among individuals for many years. Earlier, when we used to hear the term “AI”, we could only think about Robots. However AI is not limited to robots, and nowadays, every electronic device we use has AI associated with it, be it smartphones, smart TVs, refrigerators, or Air conditioners. AI basically means a machine can take its decision without human intervention. […]

Read more

An interactive and multi-function Telegram bot

PyKorone An interaction and fun bot for Telegram groups, having some useful and other useless commands. Created as an experiment and learning bot but being expanded and improved over time. This repository contains the source code of @PyKoroneBot Telegram bot, along with instructions for hosting your own instance. Requirements Installation Setup First, clone this Git repository locally: git clonehttps://github.com/AmanoTeam/PyKorone After that, you can run python3 -m pip install . to install the bot along with the dependencies. Error: Directory ‘.’ […]

Read more

Semi Supervised Raw-to-Raw Mapping in python

raw2raw Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021. Mahmoud Afifi and Abdullah Abuolaim York University Project page of the paper Semi-Supervised Raw-to-Raw Mapping. Mahmoud Afifi and Abdullah Abuolaim. arXiv preprint arXiv:2106.13883, 2021. If you use this code, please cite our paper: @article{afifi2021raw2raw, title={Semi-Supervised Raw-to-Raw Mapping}, author={Afifi, Mahmoud and Abuolaim, Abdullah}, journal={arXiv preprint arXiv:2106.13883}, year={2021} } Abstract The raw-RGB colors of a camera sensor vary due to the spectral sensitivity differences across different sensor makes and models. This paper […]

Read more

Used to format docstrings in Python files or reStructuredText

Style-Doc Style-Doc is Black for Python docstrings and reStructuredText (rst). It can be used to format docstrings (Google docstring format) in Python files or reStructuredText. Installation Style-Doc is available at the Python Package Index (PyPI). It can be installed with pip: $ pip install style-doc Usage $ style-doc –help usage: style-doc [-h] [–max_len MAX_LEN] [–check_only] [–py_only] [–rst_only] files [files …] positional arguments: files The file(s) or folder(s) to restyle. optional arguments: -h, –help show this help message and exit –max_len […]

Read more

A flask server that fixes twitter embeds in discord

TwitFix very basic flask server that fixes twitter embeds in discord by using youtube-dl to grab the direct link to the MP4 file and embeds the link to it in a custom page, with link caching This does work! but I’m new to flask, so it can probably be improved a great deal. How to use (discord side) just put the url to the server, and directly after, the full URL to the tweet you want to embed I now […]

Read more

A google search engine dorker with HTTP toolkit built with python

domhttpx domhttpx is a google search engine dorker with HTTP toolkit built with python, can make it easier for you to find many URLs/IPs at once with fast time. Install > git clone https://github.com/naufalardhani/domhttpx.git > cd domhttpx > pip3 install -r requirements.txt > python3 domhttpx.py –help Flags This will display help for the tool. Here are all the switches it supports. Flag Description Example -ip, –only-ip Show output as IP only domhttpx –only-ip -od, –only-domain Show output as domain only […]

Read more

A python parser to parse and investigate Digital Terrain Elevation Data files

DTED Parser This is a package written in pure python (with help from numpy) to parse and investigate Digital Terrain Elevation Data (DTED) files. This package is tested to work on Shuttle Radar Topography Mission (SRTM) DTED files (as far as I can tell these are the only publicly available DTED files). This can be used as a library to parse these files into numpy arrays and additionally exposes a CLI that can be used to investigate individual DTED files. […]

Read more

Add Chart.js visualizations to Django admin using a mixin class in python

django-admincharts Add Chart.js visualizations to your Django admin using a mixin class. Example from django.contrib import admin from .models import BillingAccount from admincharts.admin import AdminChartMixin from admincharts.utils import months_between_dates @admin.register(BillingAccount) class BillingAccountAdmin(AdminChartMixin, admin.ModelAdmin): def get_list_chart_data(self, queryset): if not queryset: return {} # Cannot reorder the queryset at this point earliest = min([x.ctime for x in queryset]) labels = [] totals = [] for b in months_between_dates(earliest, timezone.now()): labels.append(b.strftime(“%b %Y”)) totals.append( len( [ x for x in queryset if x.ctime.year == […]

Read more
1 520 521 522 523 524 875