Best Best AI Agent Skills for Data Analysis
Skills for data wrangling, visualization, statistical analysis, pandas workflows, and turning raw datasets into actionable insights.
The xlsx skill provides guidelines and requirements for creating, editing, and analyzing Excel files, ensuring zero formula errors and following established formatting standards for various types of financial models.
npx skills add https://github.com/anthropics/skills --skill xlsxTransform slow database queries into lightning-fast operations through systematic optimization, proper indexing, and query plan analysis.
npx skills add https://github.com/wshobson/agents --skill sql-optimization-patternsMaster Retrieval-Augmented Generation (RAG) to build LLM applications that provide accurate, grounded responses using external knowledge sources.
npx skills add https://github.com/wshobson/agents --skill rag-implementationComprehensive patterns for designing effective Key Performance Indicator (KPI) dashboards that drive business decisions.
npx skills add https://github.com/wshobson/agents --skill kpi-dashboard-designInteract with Google NotebookLM to query documentation with Gemini's source-grounded answers. Each question opens a fresh browser session, retrieves the answer exclusively from uploaded documents, and closes.
npx skills add https://github.com/pleaseprompto/notebooklm-skill --skill notebooklmMaster comprehensive evaluation strategies for LLM applications, from automated metrics to human evaluation and A/B testing.
npx skills add https://github.com/wshobson/agents --skill llm-evaluationPractical implementation guide for GDPR-compliant data processing, consent management, and privacy controls.
npx skills add https://github.com/wshobson/agents --skill gdpr-data-handlingBuilds and validates trading strategy backtests using robust methodologies to avoid biases and errors. Developers gain increased confidence in their strategy's reliability through structured tests and comprehensive analysis techniques.
npx skills add https://github.com/wshobson/agents --skill backtesting-frameworksImplements efficient similarity search workflows for various applications, including semantic search systems and recommendation engines. By utilizing this skill, developers can significantly improve search accuracy and performance, ensuring quick retrieval of similar items even from large datasets.
npx skills add https://github.com/wshobson/agents --skill similarity-search-patternsFacilitates web data extraction by rendering JavaScript and bypassing anti-bot measures to capture dynamic content. Automates the process of gathering structured data from websites, saving developers time and reducing the likelihood of errors in data collection.
npx skills add https://github.com/jezweb/claude-skills --skill firecrawl-scraperMarkItDown is a Python tool developed by Microsoft for converting various file formats to Markdown, particularly useful for converting documents into LLM-friendly text format.
npx skills add https://github.com/davila7/claude-code-templates --skill markitdownFacilitates the creation of publication-ready scientific figures by transforming data into visually appealing visualizations. Developers benefit from streamlined workflows that enhance figure quality, ensuring compliance with journal standards and accessibility for colorblind readers.
npx skills add https://github.com/davila7/claude-code-templates --skill scientific-visualizationConducts detailed exploratory data analysis on various scientific file formats to identify data quality and key insights. This automation helps researchers by providing comprehensive markdown reports, facilitating faster decision-making in their analyses.
npx skills add https://github.com/davila7/claude-code-templates --skill exploratory-data-analysisEnables semantic search and content extraction from various web pages based on user intent. Speeds up research and information retrieval by allowing users to automate the search process and get structured responses.
npx skills add https://github.com/benedictking/exa-search --skill exa-searchHandles the creation of various types of visualizations in Python, ranging from simple line charts to complex 3D plots. Enhances productivity by enabling the development of high-quality graphics, which effectively communicates data insights through attractive and customizable plots.
npx skills add https://github.com/davila7/claude-code-templates --skill matplotlibFacilitates statistical hypothesis testing through methods such as t-tests, ANOVA, and regression analysis. By providing automated assumptions checking and APA-style reporting, it enhances the reliability and efficiency of research data analysis.
npx skills add https://github.com/davila7/claude-code-templates --skill statistical-analysisHandles the creation of machine learning models for classification, regression, and clustering tasks using scikit-learn. By installing this skill, users can streamline the development and evaluation of machine learning workflows, leading to faster iterations and more reliable model performance.
npx skills add https://github.com/davila7/claude-code-templates --skill scikit-learnGenerates detailed market research reports exceeding 50 pages that analyze market dynamics and competitive landscapes. By automating report creation, it significantly reduces the time and effort required, enhancing decision-making processes for business strategies.
npx skills add https://github.com/davila7/claude-code-templates --skill market-research-reportsFacilitates efficient data manipulation and processing through a powerful DataFrame library. Users benefit by improving performance and reducing execution time for large datasets via lazy evaluation and optimized queries.
npx skills add https://github.com/davila7/claude-code-templates --skill polarsGenerates high-quality statistical visualizations directly from DataFrames in Python projects. This skill enhances data analysis productivity by simplifying complex graph creation, allowing users to visualize relationships and distributions with minimal code.
npx skills add https://github.com/davila7/claude-code-templates --skill seabornHandles the task of generating text embeddings using the Google Gemini embeddings API. By using this skill, developers can efficiently translate text into vector representations, enabling advanced applications like semantic search and document clustering.
npx skills add https://github.com/jezweb/claude-skills --skill google-gemini-embeddingsFacilitates the implementation of various machine learning models for tasks such as text generation, classification, and fine-tuning. By utilizing this skill, developers can expedite model deployment and enhance their applications without extensive manual setup.
uv pip install torch transformers datasets evaluate accelerateHandles document uploads and natural language queries in a managed RAG system, leveraging semantic search for over 100 document formats. Automates document retrieval and citation extraction, significantly enhancing search efficiency and accuracy for users.
npx skills add https://github.com/jezweb/claude-skills --skill google-gemini-file-searchFacilitates various statistical modeling processes, including regression analysis, time series forecasting, and hypothesis testing. Users gain accuracy and efficiency in their analyses by automating complex statistical methods, thus enhancing decision-making capabilities.
npx skills add https://github.com/davila7/claude-code-templates --skill statsmodelsComputes SHAP values for any machine learning model to explain predictions. This skill enhances model interpretability and trust by providing insights into feature importance and model behavior, ultimately aiding in better decision-making.
npx skills add https://github.com/davila7/claude-code-templates --skill shapFacilitates advanced machine learning tasks with time series data, such as classification, regression, and anomaly detection. By utilizing this skill, developers can enhance their predictive analytics capabilities and improve decision-making with accurate forecasts and insights.
uv pip install aeonHandles the creation of interactive visualizations using Python by providing over 40 chart types and APIs. Developers benefit from streamlined plotting capabilities that reduce the complexity of visual data representation and enhance data analysis efficiency.
npx skills add https://github.com/davila7/claude-code-templates --skill plotlyHandles single-cell genomic data analysis, including RNA-seq and chromatin accessibility. By installing this skill, developers can leverage advanced probabilistic models to automate complex analyses, accelerating research and reducing human error.
npx skills add https://github.com/davila7/claude-code-templates --skill scvi-toolsFacilitates programmatic access to extensive DrugBank data for drug research and analysis. Developers can efficiently retrieve detailed drug and interaction information, enhancing pharmacological understanding and data handling capabilities.
npx skills add https://github.com/davila7/claude-code-templates --skill drugbank-databaseHandles reading, writing, and manipulating DICOM files used in medical imaging, including pixel data and metadata management. By installing this skill, developers can automate the workflow for processing medical images, reducing errors and improving efficiency in handling complex DICOM datasets.
npx skills add https://github.com/davila7/claude-code-templates --skill pydicomRetrieves AI-predicted 3D protein structures for over 200 million proteins. Enhances research efficiency by automating data retrieval and integration into computational workflows.
npx skills add https://github.com/davila7/claude-code-templates --skill alphafold-databaseFacilitates molecular analysis and manipulation through a comprehensive Python API in cheminformatics projects. By automating tasks such as molecular structure reading and descriptor calculation, it enhances efficiency and reduces potential errors in chemical research.
npx skills add https://github.com/davila7/claude-code-templates --skill rdkitFacilitates comprehensive analysis of single-cell RNA-seq data by providing essential preprocessing and visualization tools. Users benefit from streamlined workflows that enhance data insight and enable accurate biological interpretations without extensive manual work.
npx skills add https://github.com/davila7/claude-code-templates --skill scanpyExecutes Python code in a controlled, sandboxed environment with access to numerous pre-installed libraries. This allows developers to run scripts safely and efficiently, reducing the risk of code errors and enhancing productivity.
npx skills add https://github.com/inference-sh/skills --skill python-executorFacilitates the retrieval of curated SNP-trait associations from the GWAS Catalog database. Enhances research efficiency by providing immediate access to comprehensive genetic variant data and associated statistics, streamlining investigations into genetic associations and disease traits.
npx skills add https://github.com/davila7/claude-code-templates --skill gwas-databaseHandles astronomical data analysis tasks such as unit conversions and coordinate transformations. By installing it, developers can streamline their data processing workflows, ensuring accuracy in calculations and saving time on manual conversions.
npx skills add https://github.com/davila7/claude-code-templates --skill astropyFacilitates access to genetic variant data from ClinVar for research and clinical interpretation. By employing this skill, users can effectively analyze genomic variations, saving time and enhancing the accuracy of genetic assessments.
npx skills add https://github.com/davila7/claude-code-templates --skill clinvar-databaseHandles the deployment and management of Streamlit applications directly within the Snowflake environment. This skill allows developers to create data applications more efficiently by eliminating external hosting and integrating seamlessly with Snowflake's data infrastructure.
npx skills add https://github.com/jezweb/claude-skills --skill streamlit-snowflakeFacilitates the querying and analysis of extensive single-cell genomics data from the CZ CELLxGENE repository. It accelerates research by automating data access and processing, enabling users to focus on insights rather than data management.
npx skills add https://github.com/davila7/claude-code-templates --skill cellxgene-censusFacilitates access to extensive FDA regulatory data regarding drugs, medical devices, and related safety information. Enables researchers to effectively monitor and analyze regulatory actions, improving safety assessment and compliance efforts.
npx skills add https://github.com/davila7/claude-code-templates --skill fda-databaseRetrieves and analyzes protein-protein interaction networks to understand biological relationships. By using this skill, developers can efficiently explore large datasets and validate structural biology hypotheses, streamlining the process of discovering functional relationships among proteins.
npx skills add https://github.com/davila7/claude-code-templates --skill string-databaseFacilitates comprehensive patent and trademark searches via specialized APIs for IP analysis. Enhances research efficiency by automating data retrieval, enabling users to focus on strategic insights rather than manual querying.
npx skills add https://github.com/davila7/claude-code-templates --skill uspto-databaseAutomates the generation and testing of scientific hypotheses using language models, allowing researchers to systematically explore data-driven insights. By streamlining hypothesis creation, it reduces the time needed for research development and enhances the rigor of scientific investigation.
uv pip install hypogenicFacilitates querying and accessing statistical data through the Data Commons Python API. Users can efficiently retrieve various statistical observations, allowing for improved data analysis and informed decision-making.
pip install "datacommons-client[Pandas]"Facilitates access to AI-ready datasets and benchmarks specifically designed for drug discovery and development. By using this skill, practitioners streamline data handling and model benchmarking across various therapeutic tasks, enhancing their research efficiency.
npx skills add https://github.com/davila7/claude-code-templates --skill pytdcFacilitates the retrieval and management of nucleotide sequence data from the European Nucleotide Archive (ENA). By using this skill, developers can efficiently integrate genomic data into bioinformatics pipelines, automating data access and reducing the time spent on manual data handling.
npx skills add https://github.com/davila7/claude-code-templates --skill ena-databaseAutomates the end-to-end research process, facilitating tasks like hypothesis generation and paper writing. By streamlining scientific workflows, it significantly reduces the time and effort required to move from data analysis to publication-ready manuscripts.
npx skills add https://github.com/davila7/claude-code-templates --skill denarioHandles querying metabolite data and accessing comprehensive metabolomics study information. By using this skill, developers streamline research processes and eliminate errors, thereby enhancing the efficiency of metabolomics investigations.
npx skills add https://github.com/davila7/claude-code-templates --skill metabolomics-workbench-databaseFacilitates access to comprehensive structural data of biological macromolecules, enabling users to search and retrieve detailed coordinates and metadata for various structures. Users benefit from streamlined workflows in drug discovery and structural biology research by automating data retrieval and analysis processes.
npx skills add https://github.com/davila7/claude-code-templates --skill pdb-databaseHandles the creation, manipulation, and storage of annotated data matrices, specifically tailored for genomic analysis tasks involving large datasets. Developers gain efficient data management capabilities, allowing for seamless integration and analysis of single-cell RNA-sequencing data, ultimately enhancing productivity.
npx skills add https://github.com/davila7/claude-code-templates --skill anndata