explosion/spaCy: v2.2.2: Multiprocessing, future APIs, Luxembourgish base support & simpler GPU install ...

✨ New features and improvements NEW: Support multiprocessing in nlp.pipe via the n_process argument (Python 3 only). Base language support for Luxembourgish. Add noun chunks iterator for Swedish. Retrained models for Greek, Norwegian Bokmål and Lithuanian that now correctly support parser-based sentence segmentation. Repackaged models for Greek and German with improved lookup tables via spacy-lookups-data . Add warning in debug-data for low sentences per doc ratio. Improve checks and errors related to ill-formed IOB input in convert and debug-data CLI. Support training dict format as JSONL. Ma... Mehr ...

Verfasser: Honnibal, Matthew
Montani, Ines
Landeghem, Sofie Van
Peters, Henning
Samsonov, Maxim
Geovedi, Jim",Adrianeboyd,"Regan, Jim
Orosz, György
McCann, Paul O'Leary
Kristiansen, Søren Lind
Altinok, Duygu
, Roman
Howard, Grégory
Wannaphong Phatthiyaphaibun
Bozek, Sam
Explosion Bot
Böing, Björn
Amery, Mark
Vogelsang, Leif Uwe
Tippa, Pradeep Kumar",Jeannefukumaru,GregDubbin,"Mazaev, Vadim
Balakrishnan, Ramanan
Møllerhøj, Jens Dahl",Wbwseeker,"Burton, Magnus
Avadh Patel
Dokumenttyp: SoftwareSourceCode
Erscheinungsdatum: 2019
Verlag/Hrsg.: Zenodo
Sprache: unknown
Permalink: https://search.fid-benelux.de/Record/base-29103624
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dx.doi.org/10.5281/zenodo.3524402

✨ New features and improvements NEW: Support multiprocessing in nlp.pipe via the n_process argument (Python 3 only). Base language support for Luxembourgish. Add noun chunks iterator for Swedish. Retrained models for Greek, Norwegian Bokmål and Lithuanian that now correctly support parser-based sentence segmentation. Repackaged models for Greek and German with improved lookup tables via spacy-lookups-data . Add warning in debug-data for low sentences per doc ratio. Improve checks and errors related to ill-formed IOB input in convert and debug-data CLI. Support training dict format as JSONL. Make EntityRuler ID resolution 2× faster and support "id" in patterns to set Token.ent_id . Improve rendering of named entity spans in displacy for RTL languages. Update Thinc to ditch thinc_gpu_ops for simpler GPU install. Support Mish activation in spacy pretrain . Add backwards-compatible support for new Language.disable_pipes API, which will become the default in the future. The method can now also take a list of ...