Tag: NLP

BERT vs GPT: Comparing the NLP Giants

In 2018, NLP researchers were all amazed by the BERT paper [1]. The approach was simple, yet the result was impressive: it set new benchmarks for 11 NLP tasks. In a little over a year, BERT has become a ubiquitous baseline in Natural Language Processing (NLP) experiments counting over 150 resea...

Weekly AI and NLP News — August 28th 2023

 News From The Web GPT-3.5 Turbo fine-tuning released. OpenAI has released fine-tuning for GPT-3.5 Turbo, offering improved performance on specific tasks. The fine-tuned version can even match or surpass the capabilities of base GPT-4. Early testers have significantly reduced prompt size ...

Natural Language Processing For Absolute Beginners

It is mostly true that NLP (Natural Language Processing) is a complex area of computer science. Frameworks like SpaCy or NLTK are large and often require some learning. But with the help of open-source large language models (LLMs) and modern Python libraries, many tasks can be solved much more easil...

Measurement of Social Bias Fairness Metrics in NLP Models

In recent times, text-generation-based models have become more popular than ever. With the introduction of ChatGPT and similar models, the population has been using the NLP models daily. However, the use cases for NLP models are not limited to text generation; they include sentiment analysis, key...

Weekly AI and NLP News — September 5th 2023

Here are your weekly articles, guides, and news about NLP and AI chosen for you by NLPlanet! News From The Web Introducing ChatGPT Enterprise. OpenAI has released ChatGPT Enterprise, which provides enterprise-level security and privacy features. These include unlimited access to GPT-4,...

Large Language Models, Part 1: BERT

2017was a historical year in machine learning when the Transformer model made its first appearance on the scene. It has been performing amazingly on many benchmarks and has become suitable for lots of problems in Data Science. Thanks to its efficient architecture, many other Transformer-ba...

How to Chunk Text Data — A Comparative Analysis

The ‘Text chunking’ process in Natural Language Processing (NLP) involves the conversion of unstructured text data into meaningful units. This seemingly simple task belies the complexity of the various methods employed to achieve it, each with its strengths and weaknesses. At a high l...

Weekly AI and NLP News — September 5th 2023

Here are your weekly articles, guides, and news about NLP and AI chosen for you by NLPlanet! News From The Web Introducing ChatGPT Enterprise. OpenAI has released ChatGPT Enterprise, which provides enterprise-level security and privacy features. These include unlimited access to GPT-4,...

T5: Text-to-Text Transformers (Part One)

The transfer learning paradigm is comprised of two main stages. First, we pre-train a deep neural network over a bunch of data. Then, we fine-tune this model (i.e., train it some more) over a more specific, downstream dataset. The exact implementation of these stages may take many different forms. I...

Mastering Imbalanced NLP Datasets

Natural Language Processing (NLP) has found applications in various domains, including sentiment analysis, chatbots, and content moderation. One common challenge in NLP projects is dealing with imbalanced datasets, where one class of data significantly outnumbers the other. In this blog, we’ll...

Improve your AI text-to-image prompts with enhanced NLP

Text-to-image prompts are often seen as very clinical and detached. It’s one reason why critics refuse to see it as art (an opinion I disagree with) — as if AI art is just ordered Subway-style from a string of random descriptors: hyper-realistic, cinematic, detailed, hypermaxim...

Low-latency, real time Legal AI with Spark NLP on gRPC

Some business areas have made a bigger progress than others, as if we compare Financial Services, where AI is a reality, to the Legal sector, where it is just beginning. However, datasources as SEC’s Edgar (public), Lawinsider (propietary) and many other initiatives have...

Legal NLP releases Subpoenas section classification model and more LLM examples and use cases

The latest version of Legal NLP, 1.15, introduces numerous additional features to the existing collection of 926+ models and 125+ Language Models from previous releases of the library. Let’s examine each of these new capabilities in detail. New Subpoenas section classifier This mode...

Monitoring delivery reviews with NLP in logistics

Artificial Intelligence (AI) is no longer just a buzzword, but a pivotal driver of innovation, its impact is being felt across the board. The widespread popularity of ChatGPT vividly demonstrates how Natural Language Processing (NLP), Machine Learning, and Deep Learning have shifted from futuristic ...

The Power of NLP in Preserving and Understanding Our Past

Natural language processing (NLP) has emerged as a transformative force in various fields, revolutionizing communication, understanding, and decision-making. Its impact extends beyond the digital world, holding immense potential for preserving and interpreting cultural artifacts that hold the essenc...

A Brief History of NLP leading to LLMs — Part 1

“If I have seen further than others, it is by standing upon the shoulders of giants” — Sir Isaac Newton LLMs were not invented in a vacuum and they certainly were not built overnight. Instead, it took decades of research especially in the areas of deep learning and specific...

Machine learning advancements in Arabic NLP

Natural language processing (NLP) is not a new discipline; its roots date back to the 1600s when philosophers such as Descartes and Leibniz proposed theoretical codes for language. In the past decade, the results of this long history have led to the integration of NLP into our own homes, in the form...

Machine Learning and NLP For Arabic: Part Of Speech Tagging

This project by Omdena is about Building Open Source NLP Libraries & Tools for the Arabic Language as Arabic is the 5th most spoken language in the world and there are several challenges that can be found in Arabic like complicated grammar and having several dialects. So t...

tokenization using indic NLP library

Hello! I should say नमस्ते since today’s topic is regarding Indian language. Natural Language Processing looks fascinating but it’s similar to Machine Learning where we need data cleaning and data pre-processing. Sounds boring right?  But it’s not our mi...

BERT: Why it’s been revolutionizing NLP

BERT, which stands for Bidirectional Encoder Representations from Transformers, is a language model published in 2018 that achieved state-of-the-art performance on multiple tasks, including question-answering and language understanding. It not only beat previous state-of-the-art computational models...