Update README.md
Browse files
README.md
CHANGED
|
@@ -74,10 +74,21 @@ for i, (answer, prob) in enumerate(results):
|
|
| 74 |
---
|
| 75 |
|
| 76 |
# Metrics
|
| 77 |
-
|
| 78 |
-
|
| 79 |
-
|
| 80 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 81 |
---
|
| 82 |
|
| 83 |
# Limitations
|
|
|
|
| 74 |
---
|
| 75 |
|
| 76 |
# Metrics
|
| 77 |
+
Evaluation results using the official evaluation script on SQuAD 2.0 validation set:
|
| 78 |
+
|
| 79 |
+
```json
|
| 80 |
+
{
|
| 81 |
+
"exact": 80.29141750189505,
|
| 82 |
+
"f1": 83.22890970115323,
|
| 83 |
+
"total": 11873,
|
| 84 |
+
"HasAns_exact": 72.08164642375169,
|
| 85 |
+
"HasAns_f1": 77.96505480462089,
|
| 86 |
+
"HasAns_total": 5928,
|
| 87 |
+
"NoAns_exact": 88.47771236333053,
|
| 88 |
+
"NoAns_f1": 88.47771236333053,
|
| 89 |
+
"NoAns_total": 5945
|
| 90 |
+
}
|
| 91 |
+
```
|
| 92 |
---
|
| 93 |
|
| 94 |
# Limitations
|