Programming for Corpus Linguistics with Python and Dataframes

Coding mit KI für Dummies

Encyclopedia of Cryptography, Security and Privacy, 3rd Edition

Торий, его сырьевые ресурсы, химия и технология

Современные системы безопасности - Антитеррор

The Outdoor Gas Griddle Cookbook: Master Your Flat Top Grill with Ultimate Griddling Cookbook

Алгоритмы и модели вычисления (2019)

Программирование в Clojure: Практика применения Lisp в мире Java (2013)

Культура русских поморов (2005)

Украинский легион

Схемы лечения выпадения волос

The Heart Healthy Plant-Based Cookbook

Ленивая выпечка. Сладкий выпуск

Hitler’s Hunting Squad in Southern Europe: The Bloody Path of Fritz Schubert through Occupied Crete and Macedonia

Dive!: Australian Submariners at War

Type 2 Diabetes Cookbook for Beginners

Clean Code With Java: Best practices 101

Coding mit KI für Dummies

Encyclopedia of Cryptography, Security and Privacy, 3rd Edition

Торий, его сырьевые ресурсы, химия и технология

Современные системы безопасности - Антитеррор

The Outdoor Gas Griddle Cookbook: Master Your Flat Top Grill with Ultimate Griddling Cookbook

Алгоритмы и модели вычисления (2019)

Программирование в Clojure: Практика применения Lisp в мире Java (2013)

Культура русских поморов (2005)

Украинский легион

Схемы лечения выпадения волос

The Heart Healthy Plant-Based Cookbook

Ленивая выпечка. Сладкий выпуск

Hitler’s Hunting Squad in Southern Europe: The Bloody Path of Fritz Schubert through Occupied Crete and Macedonia

Dive!: Australian Submariners at War

Type 2 Diabetes Cookbook for Beginners

Clean Code With Java: Best practices 101

Категория: КНИГИ » ПРОГРАММИРОВАНИЕ

Programming for Corpus Linguistics with Python and Dataframes

Добавил: literator / 26-05-2024, 14:38 /

Название: Programming for Corpus Linguistics with Python and Dataframes
Автор: Daniel Keller
Издательство: Cambridge University Press
Год: 2024
Страниц: 114
Язык: английский
Формат: pdf (true), epub
Размер: 10.1 MB

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

Programming often involves manipulating data. In CL, our data are samples of language, and our operations are things like counting word types, calculating association strength, measuring dispersion, and so on. To accomplish these things, we need to be able to hold and reference data in a computer’s memory, often in discrete chunks. We do this with variables. To perform operations on these variables, we write instructions (code) that the Python interpreter understands how to carry out. We can group sets of instructions and save them to be reused later. These are called functions. Often, we will use functions written by other people to save time and guarantee replicability.

This section introduces Pandas DataFrame and Series classes, methods for loading and saving them to disk, and methods and functions for counting values, grouping rows, and combining values. These form a core set of tools that can be used to accomplish a range of CL tasks. The focus in this section is on explaining these elements generally, while Section 4 describes algorithms that use these procedures to complete CL analyses speciﬁcally. We will use two data types extensively in this element, DataFrames and Series. These are not core data types in Python and must be imported through the Pandas package. However, once imported, we will be able to leverage the powerful methods built into them to do corpus linguistic tasks quickly, reliably, and with minimal hardware resources.

Скачать Programming for Corpus Linguistics with Python and Dataframes

Скачать с Turbobit

ОТСУТСТВУЕТ ССЫЛКА/ НЕ РАБОЧАЯ ССЫЛКА ЕСТЬ РЕШЕНИЕ, ПИШИМ СЮДА!

Нашел ошибку? Есть жалоба? Жми!
Пожаловаться администрации

[related-news]

[/related-news]

Комментарии 0

Комментариев пока нет. Стань первым!

Похожие публикации