Questions tagged [selection-bias]
5 questions
6
votes
0 answers
$\frac{P(x_1 \mid y, s = 1) \dots P(x_n \mid y, s = 1) P(y \mid s = 1)}{P(x \mid s = 1)}$ indicates that naive Bayes learners are global learners?
I am currently studying the paper Learning and Evaluating Classifiers under Sample Selection Bias by Bianca Zadrozny. In section 3. Learning under sample selection bias, the author says the following:
We can separate classifier learners into two…

The Pointer
- 527
- 3
- 17
3
votes
1 answer
How do I select the (number of) negative cases, if I'm given a set of positive cases?
We were given a list of labeled data (around 100) of known positive cases, i.e. people that have a certain disease, i.e. all these people are labeled with the same class (disease). We also have a much larger amount of data that we can label as…

Otto
- 33
- 5
3
votes
1 answer
What is meant by "the number of examples is reduced", and why is this the case?
I am currently studying the paper Learning and Evaluating Classifiers under Sample Selection Bias by Bianca Zadrozny. In section 3.2. Logistic Regression, the author says the following:
3.2. Logistic regression
In logistic regression, we use…

The Pointer
- 527
- 3
- 17
3
votes
1 answer
Is this referring to the true underlying distribution, or the distribution of our sample?
I am currently studying the paper Learning and Evaluating Classifiers under Sample Selection Bias by Bianca Zadrozny. In the introduction, the author says the following:
One of the most common assumptions in the design of learning algorithms is…

The Pointer
- 527
- 3
- 17
2
votes
0 answers
How does the support vector machine constraint imply that sample selection bias will not systematically affect the output of the optimisation?
I am currently studying the paper Learning and Evaluating Classifiers under Sample Selection Bias by Bianca Zadrozny. In section 3.4. Support vector machines, the author says the following:
3.4. Support vector machines
In its basic form, the…

The Pointer
- 527
- 3
- 17