D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate.
CoRR, abs/1409.0473, 2014.
O. Bajgar, R. Kadlec, and J. Kleindienst. Embracing data abundance: Booktest dataset for reading comprehension.
CoRR, abs/1610.00956, 2016.
Y. Bengio, P. Simard, and P. Frasconi. Learning long-term dependencies with gradient descent is difficult. IEEE
transactions on neural networks, 5(2):157–166, 1994.
E. Birman-Deych, A. D. Waterman, Y. Yan, D. S. Nilasena, M. J. Radford, and B. F. Gage. Accuracy of icd-9-cm
codes for identifying cardiovascular and stroke risk factors. Medical care, 43(5):480–485, 2005.
K. Cho, B. van Merrienboer, D. Bahdanau, and Y. Bengio. On the properties of neural machine translation:
Encoder-decoder approaches. In EMNLP 2014, Eighth Workshop on Syntax, Semantics and Structure in
Statistical Translation, Doha, Qatar, 25 October 2014, pages 103–111, 2014a.
K. Cho, B. van Merriënboer, Ç. Gülçehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. Learning
phrase representations using rnn encoder–decoder for statistical machine translation. In Proceedings of the
2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Oct. 2014b.
J. L. Elman. Finding structure in time. Cognitive Science, 14(2):179–211, 1990.
A. Graves. Supervised Sequence Labelling with Recurrent Neural Networks, volume 385 of Studies in Computa-
tional Intelligence. Springer, 2012.
N. R. Greenbaum, Y. Jernite, Y. Halpern, S. Calder, L. A. Nathanson, D. A. Sontag, and S. Horng. Contextual
autocomplete: A novel user interface using machine learning to improve ontology usage and structured data
capture for presenting problems in the emergency department. bioRxiv, page 127092, 2017.
M. Henaff, J. Weston, A. Szlam, A. Bordes, and Y. LeCun. Tracking the world state with recurrent entity
networks. arXiv preprint arXiv:1612.03969, 2016.
S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
D. C. Hsia, W. M. Krushat, A. B. Fagan, J. A. Tebbutt, and R. P. Kusserow. Accuracy of diagnostic coding for
medicare patients under the prospective-payment system. New England Journal of Medicine, 318(6):352–355,
1988.
Y. Jernite, Y. Halpern, S. Horng, and D. Sontag. Predicting chief complaints at triage time in the emergency
department. In NIPS 2013 Workshop on Machine Learning for Clinical Data Analysis and Healthcare, 2013.
A. E. Johnson, T. J. Pollard, L. Shen, L.-w. H. Lehman, M. Feng, M. Ghassemi, B. Moody, P. Szolovits, L. A.
Celi, and R. G. Mark. MIMIC-III, a freely accessible critical care database. Scientific data, 3, 2016.
S. Joshi, S. Gunasekar, D. Sontag, and J. Ghosh. Identifiable phenotyping using constrained non–negative matrix
factorization. arXiv preprint arXiv:1608.00704, 2016.
G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer. Neural architectures for named entity
recognition. arXiv preprint arXiv:1603.01360, 2016.
T. Lei, R. Barzilay, and T. Jaakkola. Rationalizing neural predictions. arXiv preprint arXiv:1606.04155, 2016.
L. V. Lita, S. Yu, R. S. Niculescu, and J. Bi. Large scale diagnostic code classification for medical patient records.
In IJCNLP, pages 877–882. Citeseer, 2008.
T. Mikolov, M. Karafiát, L. Burget, J. Cernocký, and S. Khudanpur. Recurrent neural network based language
model. In INTERSPEECH 2010, Makuhari, Chiba, Japan, September 2010, pages 1045–1048, 2010.