Questions tagged [named-entity-recognition]
21 questions
4
votes
1 answer
How to understand 'losses' in Spacy's custom NER training engine?
From the tid-bits, I understand of neural networks (NN), the Loss function is the difference between predicted output and expected output of the NN. I am following this tutorial, the losses are included at line #81 in the nlp.update() function.
I am…

The White Cloud
- 157
- 1
- 7
2
votes
1 answer
Does BERT freeze the entire model body when it does fine-tuning?
Recently, I came across the BERT model. I did some research and tried some implementations.
I wanted to tackle a NER task, so I chose the BertForSequenceClassifications provided by HuggingFace.
for epoch in range(1, args.epochs + 1):
total_loss…

Joon
- 51
- 1
- 6
2
votes
1 answer
Which AI techniques are there that combine multiple models to make sense of data at different stages?
I have been working to design a system that uses multiple machine learning models to make sense of data that is dynamically webscraped. Each AI would handle a specific task, for example:
An AI model would identify text in an image, then attempt to…

Bipbupbop
- 23
- 2
2
votes
0 answers
Extracting "hidden" costs from financial statements using NLP
I'm designing a NLP model to extract various kinds of "hidden" expenses from 10-K and 10-Q financial statements. I've come up with about 7 different expense categories (restructuring costs, merger and acquisitions, etc.) and for each one I have a…

hohner
- 21
- 2
2
votes
1 answer
Is it possible to create a named entity recognition system without using POS tagging in the corpus?
Is it possible to create a named entity recognition system without using POS tagging in the corpus?
2
votes
2 answers
What are the main ideas behind NER?
Named entity recognition (NER), also known as entity chunking/extraction, is a popular technique used in information extraction to identify and segment the named entities and classify or categorize them under various predefined classes.
Briefly,…

Pluviophile
- 1,223
- 5
- 17
- 37
1
vote
1 answer
How to extract product codes within unstructured data
I am working on a project where I have a dataset consisting of unstructured data from multiple ERP systems. Each dataset (extracted from an ERP) has different columns, and unfortunately, there is no standard format for the data. Among the columns,…

delucca
- 11
- 4
1
vote
0 answers
How can I use this Reformer to extract entities from a new sentence?
I have been looking at the NER example with Trax in this notebook. However, the notebook only gives an example for training the model. I can't find any examples of how to use this model to extract entities from a new string of text.
I've tried the…

Alan Buxton
- 121
- 5
1
vote
0 answers
Determining if an entity in free text is 'present' or 'absent'; what is this called in NLP?
I'm processing a semi-structured scientific document and trying to extract some specific concepts. I've actually made quite good progress without machine-learning so far, but I got to a block of true free text and I'm wondering whether a very narrow…

vector07
- 111
- 2
1
vote
0 answers
Are there any good resources (preferably books) about techniques used for entity extraction?
Given some natural language sentences like
I would like to talk to Mr. Smith
I would like to extract entities, like the person "Smith".
I know that frameworks, which are capable of doing so (f. e. RASA or spaCy), exist, but I would like to dive…

BudBrot
- 61
- 4
0
votes
1 answer
How to label missing/default values for a named entity recognition dataset
I am building the training dataset for a named entity recognition model, with 2 tags: Name and Category and I am using a pre-trained spaCy model.
Given a document, the model needs to extract the name and category of several items.
However, the name…

mrang
- 3
- 2
0
votes
0 answers
How to create dataset to extract information and classify intent using BERT?
Given a message: "Hey I am XYZ person (description about oneself), and I was thinking to launch a youtube video, wanted to get in touch with someone with similar experience", the model should detect two things: 1) description about the sender, 2)…
0
votes
0 answers
Convert unstructured text to XML
I am working with a large dataset (example bellow) of unstructured and structured documents that contain a variety of text elements, including paragraphs, tables, and lists. Currently, the process of generating the equivalent XML markup for each…

Nasreddine
- 101
0
votes
1 answer
Is there a way to find a list of associated words to a named entity?
Is there a way to find a list of associated words to a named entity?
For instance, let the named entity be FIFA. Now FIFA is a Football Organization and hence related to the term football and all the terms related to football like jersey,…

Vivek Hotti
- 1
- 1
0
votes
0 answers
Are there techniques to somehow "rank" these URIs based on other properties?
I am using Stanford's Stanza pipeline to perform Named Entity Recognition on news articles.
For every NE span of type PERson I am attempting to link corresponding DBPedia entities (Named Entity Linking) but of course I can get more than one…

Robert Alexander
- 101
- 1