Greetings AI Community 👋

Honoured to Contribute on Book "Generative AI Application Integration Patterns" Published by Packt

Generative AI Application Integration Patterns"@Packt is Avaiable on Amazon Grab your Copy Now !!

Book Overview

Unleash the transformative potential of GenAI with this comprehensive guide that serves as an indispensable roadmap for integrating large language models into real-world applications. Gain invaluable insights into identifying compelling use cases, leveraging state-of-the-art models effectively, deploying these models into your applications at scale, and navigating ethical considerations.

Key Features

Get familiar with the most important tools and concepts used in real scenarios to design GenAI apps Interact with GenAI models to tailor model behavior to minimize hallucinations Get acquainted with a variety of strategies and an easy to follow 4 step frameworks for integrating GenAI into applications.

Thank you so much for Visiting my Profile

** Global Top AI Community Member @Landing AI @MLOPS Community, @Pandas AI, @Full Stack Deep Learning, @HumaneAI @H2o.ai Generative AI, @Modular & @Cohere AI @Hugging Face Research Papers Group @Papers with Code** ** Completed 100+ Online Technical Paid Courses from Udemy & Coursera as I believe in Continuous Learning and Growth Mindset **

AWS & AI Research Specialist-Full Stack Applied AI Research Scientist & Enterprise Architect- AI Search & Developer Platforms Tech Stack

>

## ** Developer Tools ** ##

AI Research Junction@Aditi Khare-Research Papers Summaries @Gen AI @Computer Vision @Quantum AI

** AI Research Junction@Aditi Khare-Research Papers Summaries @Gen AI @Computer Vision @Quantum AI**

My AI Research Junction Newsletter @Research Papers Summaries @Generative AI, @Computer Vision & @Quantum AI

** Subscribe to my AI Research Junction Newletter **

Thank you so much for visting my Profile & Happy Reading !!

AI RESEARCH PAPERS SUMMARIES

GENERATIVE AI & QUANTUM AI RESEARCH PAPERS SUMMARIES

OCTOBER 2024

AI RESEARCH PAPERS	PAPERS SUMMARIES	RESOURCE LINKS	PAPERS CATEGORY
2. Claude 3.5 Sonnet and Claude 3.5 Haiku	Claude 3.5 Sonnet and Claude 3.5 Haiku	Blog	GENERATIVE AI
1. Open AI Introduces Swarm Framework	Open AI Introduces Swarm Framework	Github	GENERATIVE AI

SEPTEMBER 2024

AI RESEARCH PAPERS	PAPERS SUMMARIES	RESOURCE LINKS	PAPERS CATEGORY
9. Meta'sLlama 3.2-Edge AI & Vision with Open, Customizable Models	Meta's Llama 3.2 - Edge AI & Vision with Open, Customizable Models	Blog	EDGE AI & VISION
8. Training Language Models to Self-Correct via Reinforcement Learning	Training Language Models to Self-Correct via Reinforcement Learning	Paper	GENERATIVE AI
7. Iteration of Thought-Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning	Iteration of Thought-Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning	Paper	GENERATIVE AI
6. Learning to Reason with LLMs-Introducing OpenAI o1	Learning to Reason with LLMs - Introducing OpenAI o1	Paper	GENERATIVE AI
5. AutoGen Studio	AutoGen Studio	Github	GENERATIVE AI
4. Strategic Chain-of-Thought	Strategic Chain-of-Thought	Paper	GENERATIVE AI
3. Role of RAG Noise in Large Language Models	Role of RAG Noise in Large Language Models	Paper	GENERATIVE AI
2. Learning to Simulate Dynamic Environments with GameGAN	Learning to Simulate Dynamic Environments with GameGAN	Github	GENERATIVE AI
1. Agentic Retrieval-Augmented Generation for Time Series Analysis	Agentic Retrieval-Augmented Generation for Time Series Analysis	Paper	GENERATIVE AI

AUGUST 2024

AI RESEARCH PAPERS	PAPERS SUMMARIES	RESOURCE LINKS	PAPERS CATEGORY
3. The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery	The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery	Paper	GENERATIVE AI
2. ControlNeXt-Powerful and Efficient Control for Image and Video Generation	ControlNeXt-Powerful and Efficient Control for Image and Video Generation.	Paper	COMPUTER VISION
1. RAG-Checker-A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation	RAG-Checker-A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation.	Paper	GENERATIVE AI

JUNE-JULY 2024

AI RESEARCH PAPERS	PAPERS SUMMARIES	RESOURCE LINKS	PAPERS CATEGORY
4. Microsoft's SAMBA LLM Model.	Microsoft's SAMBA LLM Model.	Paper	GENERATIVE AI
3. Quantinuum's Quixer-Quantum Transformer Model.	Quantinuum's Quixer-Quantum Transformer Model.	Paper	QUANTUM AI
2. Open AI's Extracting Concepts from GPT-4.	Open AI's Extracting Concepts from GPT-4.	Open-AI Blog	GENERATIVE AI
1. Meta AI's No Language Left Behind- Scaling Human-Centered Machine Translation.	Meta AI's No Language Left Behind- Scaling Human-Centered Machine Translation.	Github	GENERATIVE AI

MAY 2024

AI RESEARCH PAPERS	PAPERS SUMMARIES	RESOURCE LINKS	PAPERS CATEGORY
1. Quantum Solutions Lab team diving deep into the prospects for quantum speedups with near-term Rydberg atom arrays.	JPMorgan Chase and AWS study the prospects for quantum speedups with near-term Rydberg atom arrays.	Amazon Blog	QUANTUM AI
2. Learning Quantum Computing.	Learning Quantum Computing.	Github	QUANTUM AI
3. Meta's LLama3.	Extending Llama-3's Context Ten-Fold Overnight.	paper	GENERATIVE AI
4. Meta's Muti-Token Prediction.	Better & Faster Large Language Models via Multi-token Prediction.	Paper	GENERATIVE AI
5. Comprehensive Library of Variational LSE Solvers.	Comprehensive Library of Variational LSE Solvers.	Paper	QUANTUM AI
6. Evaluating LLM Generations with a Panel of Diverse Models.	Evaluating LLM Generations with a Panel of Diverse Models.	Paper	GENERATIVE AI
7. Snowflake Arctic-Best LLM for Enterprise AI — Efficiently Intelligent, Truly Open	Top-tier enterprise intelligence at incredibly low training cost.	Snowflake Blog	GENERATIVE AI
8. Photonic Quantum Memory Capacity Expanded, Paving Way for Quantum Internet	Photonic Quantum Memory Capacity Expanded, Paving Way for Quantum Internet.	quantumzeitgeist	QUANTUM AI
9. Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone	Phi-3 Technical Report-A Highly Capable Language Model Locally on Your Phone.	Paper	GENERATIVE AI
10. Make Your LLM Fully Utilize the Context	Make Your LLM Fully Utilize the Context.	Paper	QUANTUM AI
11. Production Guides - Implementing FrugalGPT: Reducing LLM Costs & Improving Performance	Implementing FrugalGPT: Reducing LLM Costs & Improving Performance.	Blog	GENERATIVE AI
12. Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or - How I learned to start worrying about prompt formatting	Implementing FrugalGPT: Reducing LLM Costs & Improving Performance.	Paper	GENERATIVE AI
13. Inner Workings of Transformer-based Language Models	Inner Workings of Transformer-based Language Models.	Paper	GENERATIVE AI
14. Open-AI's GPT-4o [Audio,Vision & Text] Capabilities	Open-AI's GPT-4o [Audio,Vision & Text] Capabilities.	Open-AI Blog	GENERATIVE AI
15. LoRA Learns Less and Forgets Less	LoRA Learns Less and Forgets Less	Paper	GENERATIVE AI
16. GNN-RAG-Graph Neural Retrieval for LLMs Reasoning	GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning	Github	GENERATIVE AI
17. One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments	One QuantLLM for ALL-Fine-tuning QuantizedLLMs Once for Efficient Deployments	Paper	GENERATIVE AI
18. Faithful Logical Reasoning via Symbolic Chain-of-Thought	Faithful Logical Reasoning via Symbolic Chain-of-Thought	Paper	GENERATIVE AI
19. Contextual Position Encoding - Learning to Count What’s Important	Contextual Position Encoding-Learning to Count What’s Important	Paper	GENERATIVE AI
20. Introduction to Vision-Language Modeling	Vision-Language Modeling	Paper	GENERATIVE AI

JAN 2024 - APRIL 2024

AI RESEARCH PAPERS	PAPERS SUMMARIES	RESOURCE LINKS	PAPERS CATEGORY
1. Practical application of quantum neural network to materials informatics	This Paper aims to construct a QNN model to predict the melting points of metal oxides as an example of a multivariate regression task for the MI problem. Different architectures (encoding methods and entangler arrangements) are explored to create an effective QNN model.	Paper	QUANTUM AI
2. BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading	In this paper, we investigate this potential advantage by proposing a batch RL algorithm that utilizes VQC as function approximators within the discrete batch-constraint deep Q-learning (BCQ) algorithm. Additionally, we introduce a novel data re-uploading scheme by cyclically shifting the order of input variables in the data encoding layers. We evaluate the efficiency of our algorithm on the OpenAI CartPole environment and compare its performance to the classical neural network-based discrete BCQ.	Paper	QUANTUM AI
3. Quantum Algorithms: A New Frontier in Financial Crime Prevention	This Paper showcases advanced methodologies such as Quantum Machine Learning (QML) and Quantum Artificial Intelligence (QAI) as powerful solutions for detecting and preventing financial crimes, including money laundering, financial crime detection, cryptocurrency attacks, and market manipulation.	Paper	QUANTUM AI-Machine Learning
4. Ground state-based quantum feature maps	This Paper Introduces a quantum data embedding protocol based on the preparation of a ground state of a parameterized Hamiltonian.	Paper	Quantum Physics
5. Satellite-based entanglement distribution and quantum teleportation with continuous variables	This Paper the effects of atmospheric turbulence in continuous-variable entanglement distribution and quantum teleportation in the optical regime between a ground station and a satellite.	Paper	Quantum Physics

GENERATIVE AI RESEARCH PAPERS SUMMARIES

FEB-MARCH-APRIL 2024

AI Research Papers	Papers Summaries	Resource Links	Papers Category
1. Leave No Context Behind:Efficient Infinite Context Transformers with Infini-attention	This Paper Introduces an efficient method to scale Transformer-based Large Language Models (LLMs) to infinitely long inputs with bounded memory and computation. A key component in our proposed approach is a new attention technique dubbed Infini-attention.	Paper,Github,Hugging-Face	Generative AI
2. Google Research's CodecLM	CodecLM - General Framework for adaptively generating high quality synthetic data for LLM alignment with different downstream instruction distributions and LLMs.	Paper	LLMs Synthetic Data
3. Multilingual Large Language Model-An Overview	This Paper Multilingual Large Language Models are capable of using powerful LLMs to handle and respond to queries supporting multiple languages, which achieves remarkable success in multilingual natural language processing tasks. Despite these breakthroughs, there still remains a lack of a comprehensive surveys to summarize existing approaches.	Paper	MLLMs
4. Google-DeepMind's Mixture-of-Depths	This Paper Introduces an approach of Dynamically allocating compute in Transformer-based Language Models.	Paper	Generative AI
5. Giskard	Open-Source Evaluation & Testing framework for LLMs and ML models.	Docs,Github,Hugging-Face	Generative AI
6. Databricks-Mosaic AI Research's DBRX	This state-of-the-art quality comes with marked improvements in training and inference performance, DBRX advances the state-of-the-art in efficiency among open models due to its fine-grained mixture-of-experts (MoE) architecture.	Hugging-Face,Github,Blog	Generative AI
7. AI21's Jamba	SSM-Transformer Model-AI2I Labs.	Blog,Website,Hugging-Face	Generative AI
8. Google's Genie	Open Language Model - Accelerating the Science of Language Models-Weights.	Paper	Generative AI
9. Allen Institute for AI's OLMo	Generative Interactive Environments & Large Language Models on Tabular Data.	Paper,Github,Hugging-Face, Website	Generative AI

JAN 2024

Amazon's Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models - https://arxiv.org/abs/2401.13795 - Computer Vision and Pattern Recognition
Quantum-Inspired Machine Learning for Molecular Docking - https://arxiv.org/abs/2401.12999 - Quantum Computing
ChatQA - NIVIDIA'S GPT-4 Level Conversational QA Models - https://arxiv.org/pdf/2401.10225v1.pdf - Generative AI
Meta's Self-Rewarding Language Models - https://arxiv.org/abs/2401.10020 - Generative AI
Chainpoll - A high efficacy method for LLM hallucination detection - https://arxiv.org/pdf/2310.18344v1.pdf
AI-Optimized-Catheter-Design-could-prevent-urinary-tract-infections-without-drugs/ - https://www.scientificamerican.com/article/ai-optimized-catheter-design-could-prevent-urinary-tract-infections-without-drugs/
TrustLLM - Trustworthiness in Large Language Models - https://arxiv.org/abs/2308.05374
LLaMA Pro: Progressive LLaMA with Block Expansion - https://arxiv.org/abs/2401.02415
DeepSeekMoE - Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models - https://arxiv.org/abs/2401.06066
VAST AI releases Triplane Meets Gaussian Splatting on Hugging Face - Fast and Generalizable Single-View 3D Reconstruction with Transformers Demo - https://arxiv.org/abs/2312.09147
Masked Audio Generation using a Single Non-Autoregressive Transformers - https://arxiv.org/abs/2401.04577

AI LEARNING RESOURCES

AI LEARNING RESOURCES DESCRIPTION	AI LEARNING RESOURCES	AI LEARNING CATEGORY
1. Open AI's Prompt Engineering Handbook	website,Github	Generative AI
2. Lilianweng's Gitub Resources & Blogs	Github	Generative AI
3. Real-time mMchine-Learning-Challenges & Solutions by @Chip Huyen	blog	PRODUCTION-GRADE MACHINE LEARNING SYSTEMS
4. Building LLM Applications for Production	Blog	PRODUCTION-GRADE LLMS APPLICATIONS

AI RESEARCH PAPERS COLLECTION [JAN 2023 - DEC 2023]

DEC 2023

11th Dec 2023 - ** Mistral-embed - An embedding model with a 1024 embedding dimension, achieves 55.26 on MTEB** - https://mistral.ai/news/mixtral-of-experts/-
11th Dec 2023 - ** LLM360-Fully Transparent Open-Source LLMs** -https://arxiv.org/pdf/2312.06550.pdf
12th Dec 2023 - ** Mathematical Language Models: A Survey**-https://arxiv.org/abs/2312.07622
13th Dec 2023 - ** PromptBench: A Library for Evaluation of Large Language Models**-https://arxiv.org/pdf/2312.07910.pdf
1st Dec 2023 - ** Mamba: Linear-Time Sequence Modeling with Selective State Spaces**-https://arxiv.org/ftp/arxiv/papers/2312/2312.00752.pdf
14th Dec 2023 - ** Distributed Representations of Words and Phrases and their Compositionality (Word2vec)**-https://papers.nips.cc/paper_files/paper/2013/file/9aa42b31882ec039965f3c4923ce901b-Paper.pdf
11th Dec 2023 - ** Beyond Human Data - Scaling Self-Training for Problem-Solving with Language Models**
- Approach for self-training with feedback that can substantially reduce dependence on human-generated data.
- Combined Model-generated data with a reward function improves the performance of LLMs on problem-solving tasks-https://arxiv.org/abs/2312.06585
21st Dec 2023 - ** Exploiting Novel GPT-4 APIs ** - https://arxiv.org/abs/2312.14302
18th Dec 2023 - ** Design of Quantum Machine Learning Course for a Computer Science Program ** - https://ieeexplore.ieee.org/document/10313632
2nd Dec 2023 - ** Hybrid Quantum Neural Network in High-dimensional Data Classification ** - https://arxiv.org/abs/2312.01024

Nov 2023

9th Nov 2023 - A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges- https://arxiv.org/abs/2311.05112
System 2 Attention -https://arxiv.org/abs/2311.11829),
Advancing Long-Context LLMs - Overview of the methodologies for enhancing Transformer architecture modules that optimize long-context capabilities across all stages from pre-training to inference.-https://arxiv.org/abs/2311.12351.
Parallel Speculative Sampling-https://arxiv.org/abs/2311.13581.
Mirasol3B-https://arxiv.org/abs/2311.05698
GPQA -https://arxiv.org/abs/2311.12022
Chain-of-Thought Reasoning to Language Agents - summary of CoT reasoning, foundational mechanics underpinning CoT techniques, and their application to language agent frameworks.-https://arxiv.org/abs/2311.11797
GAIA-https://arxiv.org/abs/2311.12983.
LLMs for Scientific Discovery-https://arxiv.org/abs/2311.08401. 12.Contrastive CoT Prompting-https://arxiv.org/abs/2311.09277. 13.A Survey on Language Models for Code-https://arxiv.org/abs/2311.07989v1
JARVIS-1 - Open-world agent that can perceive multimodal input-https://arxiv.org/abs/2311.05997.
Learning to Filter Context for RAG-https://arxiv.org/abs/2311.08377v1.
MART-https://arxiv.org/abs/2311.07689. 17.LLMs can Deceive Users-Explores the use of an autonomous stock trading agent powered by LLMs; finds that the agent acts upon insider tips and hides the reason behind the trading decision; shows that helpful and safe LLMs can strategically deceive users in a realistic situation without direction instructions or training for deception-https://arxiv.org/abs/2311.07590.
Hallucination in LLMs-A comprehensive survey-https://arxiv.org/abs/2311.05232.
GPT4All-Outlines technical details of the GPT4All model family along with the open-source repository that aims to democratize access to LLMs-https://arxiv.org/abs/2311.04931.
FreshLLMs-https://arxiv.org/abs/2310.03214.

OCT 2023

Spectron--https://arxiv.org/abs/2305.15255.
LLMs Meet New Knowledge-https://arxiv.org/abs/2310.14820
Detecting Pretraining Data from LLMs-https://arxiv.org/abs/2310.16764.
Managing AI Risks-https://managing-ai-risks.com/managing_ai_risks.pdf.
Branch-Solve-Merge Reasoning in LLMs-https://arxiv.org/abs/2310.15123.
LLMs for Software Engineering-https://arxiv.org/abs/2310.11511.
Retrieval-Augmentation for Long-form Question Answering-https://arxiv.org/abs/2310.12150.
A Study of LLM-Generated Self-Explanations**-https://arxiv.org/abs/2310.11207.
OpenAgents-https://arxiv.org/abs/2310.10634v1. | 12.LLMs can Learn Rules-https://arxiv.org/abs/2310.07064. 13.Meta Chain-of-Thought Prompting - a generalizable chain-of-thought-https://arxiv.org/abs/2310.06692.
Improving Retrieval-Augmented LMs with Compressors-https://arxiv.org/abs/2310.04408.
Retrieval meets Long Context LLMs-https://arxiv.org/abs/2310.03025.
StreamingLLM-https://arxiv.org/abs/2309.17453. 17.The Dawn of LMMs-Comprehensive analysis of GPT-4V to deepen the understanding of large multimodal models-https://arxiv.org/abs/2309.17421.
Training LLMs with Pause Tokens-https://arxiv.org/abs/2310.02226. 19.Analogical Prompting-https://arxiv.org/abs/2310.01714.

SEPT 2023

AlphaMissense-https://www.science.org/doi/10.1126/science.adg7492.
Chain-of-Verification reduces Hallucination in LLMs-Develops a method to enable LLMs to "deliberate" on responses to correct mistakes; include the following steps: 1) draft initial response, 2) plan verification questions to fact-check the draft.
Contrastive Decoding Improves Reasoning in Large Language Models-https://arxiv.org/abs/2309.09117.
LongLoRA - Efficient fine-tuning approach to significantly extend the context windows of pre-trained LLMs; implements shift short attention-a substitute that approximates the standard self-attention pattern during training; it has less GPU memory cost and training time compared to full fine-tuning while not compromising accuracy- https://arxiv.org/abs/2309.12307.
Textbooks Are All You Need II-New 1.3 billion parameter model trained on 30 billion tokens; the dataset consists of "textbook-quality" synthetically generated data; phi-1.5 competes or outperforms other larger models on reasoning tasks suggesting that data quality plays a more important role than previously thought-https://arxiv.org/abs/2309.05463.
The Rise and Potential of LLM Based Agents - A Comprehensive overview of LLM based agents; covers from how to construct these agents to how to harness them for good-https://arxiv.org/abs/2309.07864. |

AUG 2023

Open Problem and Limitation of RLHF - provides an overview of open problems and the limitations of RLHF- https://arxiv.org/abs/2307.15217
Skeleton-of-Thought - proposes a prompting strategy that firsts generate an answer skeleton and then performs parallel API calls to generate the content of each skeleton point; reports quality improvements in addition to speed-up of up to 2.39x-https://arxiv.org/abs/2307.153373. 3.MetaGPT - a framework involving LLM-based multi-agents that encodes human standardized operating procedures (SOPs) to extend complex problem-solving capabilities that mimic efficient human workflows; this enables MetaGPT to perform multifaceted software development, code generation tasks, and even data analysis using tools like AutoGPT and LangChain-https://arxiv.org/abs/2308.00352v2
OpenFlamingo - Introduces a family of autoregressive vision-language models ranging from 3B to 9B parameters; the technical report describes the models, training data, and evaluation suite-https://arxiv.org/abs/2308.01390.

JULY 2023

Universal Adversarial LLM Attacks**-Finds universal and transferable adversarial attacks that cause aligned models like ChatGPT and Bard to generate objectionable behaviors; the approach automatically produces adversarial suffixes using greedy and gradient search-https://arxiv.org/abs/2307.15043.
A Survey on Evaluation of LLMs-Comprehensive overview of evaluation methods for LLMs focusing on what to evaluate, where to evaluate, and how to evaluate-https://arxiv.org/abs/2307.03109
How Language Models Use Long Contexts-Finds that LM performance is often highest when relevant information occurs at the beginning or end of the input context; performance degrades when relevant information is provided in the middle of a long context-https://arxiv.org/abs/2307.03172.
LLMs as Effective Text Rankers-Proposes a prompting technique that enables open-source LLMs to perform state-of-the-art text ranking on standard benchmarks- https://arxiv.org/abs/2306.17563
Multimodal Generation with Frozen LLMs-Introduces an approach that effectively maps images to the token space of LLMs; enables models like PaLM and GPT-4 to tackle visual tasks without parameter updates; enables multimodal tasks and uses in-context learning to tackle various visual tasks-https://arxiv.org/abs/2306.17842.
CodeGen2.5-New code LLM trained on 1.5T tokens; the 7B model is on par with >15B code-generation models and it’s optimized for fast sampling-https://arxiv.org/abs/2305.02309.
InterCode -Framework of interactive coding as a reinforcement learning environment that is different from the typical coding benchmarks that consider a static sequence-to-sequence process- https://arxiv.org/abs/2306.14898.

JUNE 2023

LeanDojo - Open-Source Lean Playground consisting of toolkits, data, models, and benchmarks for theorem proving-also develops ReProver, Retrieval augmented LLM-based prover for theorem solving using premises from a vast math library-https://arxiv.org/abs/2306.15626.
Extending Context Window of LLMs-https://arxiv.org/abs/2306.15595.
Computer Vision Through the Lens of Natural Language-https://arxiv.org/abs/2306.16410.
Understanding Theory-of-Mind in LLMs with LLMs- Framework for procedurally generating evaluations with LLMs; proposes a benchmark to study the social reasoning capabilities of LLMs with LLMs. https://arxiv.org/abs/2306.15448.
Evaluations with No Labels-https://arxiv.org/abs/2306.13651v1
Long-range Language Modeling with Self-Retrieval-https://arxiv.org/abs/2306.13421.
Scaling MLPs-A Tale of Inductive Bias - Shows that the performance of MLPs improves with scale and highlights that lack of inductive bias can be compensated- https://arxiv.org/abs/2306.13575
Textbooks Are All You Need-https://arxiv.org/abs/2306.11644
RoboCat - New Foundation agent that can operate different robotic arms and can solve tasks from as few as 100 demonstrations; the self-improving AI agent can self-generate new training data to improve its technique and get more efficient at adapting to new tasks-https://arxiv.org/abs/2306.11706. |
ClinicalGPT - Language model optimized through extensive and diverse medical data, including medical records, domain-specific knowledge, and multi-round dialogue consultations. https://arxiv.org/abs/2306.09968s |
An Overview of Catastrophic AI Risks - provides an overview of the main sources of catastrophic AI risks; the goal is to foster more understanding of these risks and ensure AI systems are developed in a safe manner-https://arxiv.org/abs/2306.12001v1.
AudioPaLM-Text-based and speech-based LMs, PaLM-2 and AudioLM, into a multimodal architecture that supports speech understanding and generation; outperforms existing systems for speech translation tasks with zero-shot speech-to-text translation capabilities-https://arxiv.org/abs/2306.12925v1.

MAY 2023

Gorilla-Finetuned LLaMA-based model that surpasses GPT-4 on writing API calls. This capability can help identify the right API, boosting the ability of LLMs to interact with external tools to complete specific tasks-https://arxiv.org/abs/2305.15334.
The False Promise of Imitating Proprietary LLMs - provides a critical analysis of models that are finetuned on the outputs of a stronger model; argues that model imitation is a false premise and that the higher leverage action to improve open source models is to develop better base models-https://arxiv.org/abs/2305.15717
InstructBLIP-Explores visual-language instruction tuning based on the pre-trained BLIP-2 models; achieves state-of-the-art zero-shot performance on 13 held-out datasets, outperforming BLIP-2 and Flamingo. https://arxiv.org/abs/2305.06500
Active Retrieval Augmented LLMs-Introduces FLARE, retrieval augmented generation to improve the reliability of LLMs; FLARE actively decides when and what to retrieve across the course of the generation; demonstrates superior or competitive performance on long-form knowledge-intensive generation tasks-https://arxiv.org/abs/2305.06983.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head - connects ChatGPT with audio foundational models to handle challenging audio tasks and a modality transformation interface to enable spoken dialogue-https://arxiv.org/abs/2304.12995

MARCH 2023

GPT-4 Technical Report-https://arxiv.org/abs/2303.08774v2.
An Overview on Language Models: Recent Developments and Outlook-Provides overview of anguage models covering recent developments and future directions. It also covers topics like linguistic units, structures, training methods, evaluation, and applications-https://arxiv.org/abs/2303.05759.
Eliciting Latent Predictions from Transformers with the Tuned Lens**-Method for transformer interpretability that can trace a language model predictions as it develops layer by layer-https://arxiv.org/abs/2303.08112.

FEB 2023

Multimodal Chain-of-Thought Reasoning in Language Models-https://arxiv.org/abs/2302.00923.
Dreamix: Video Diffusion Models are General Video Editors - a diffusion model that performs text-based motion and appearance editing of general videos.
Benchmarking Large Language Models for News Summarization-https://arxiv.org/abs/2301.13848.

JAN 2023

Rethinking with Retrieval: Faithful Large Language Model Inference - shows the potential of enhancing LLMs by retrieving relevant external knowledge based on decomposed reasoning steps obtained through chain-of-thought prompting- https://arxiv.org/abs/2301.00303.
SparseGPT: Massive Language Models Can Be Accurately Pruned In One-Shot - Presents a technique for compressing large language models while not sacrificing performance; "pruned to at least 50% sparsity in one-shot, without any retraining-https://arxiv.org/abs/2301.00774.
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders-https://arxiv.org/abs/2301.00808.

Files

README.md

Latest commit

History

README.md

File metadata and controls

Greetings AI Community 👋

** Honoured to Contribute on Book "Generative AI Application Integration Patterns" Published by Packt **

Generative AI Application Integration Patterns"@Packt is Avaiable on Amazon Grab your Copy Now !!

Book Overview

Key Features

Thank you so much for Visiting my Profile

** AWS & AI Research Specialist-Full Stack Applied AI Research Scientist & Enterprise Architect- AI Search & Developer Platforms Tech Stack **

** AI Research Junction@Aditi Khare-Research Papers Summaries @Gen AI @Computer Vision @Quantum AI **

** My AI Research Junction Newsletter @Research Papers Summaries @Generative AI, @Computer Vision & @Quantum AI **

Thank you so much for visting my Profile & Happy Reading !!

AI RESEARCH PAPERS SUMMARIES

** GENERATIVE AI & QUANTUM AI RESEARCH PAPERS SUMMARIES **

** OCTOBER 2024 **

** SEPTEMBER 2024 **

** AUGUST 2024 **

** JUNE-JULY 2024 **

** MAY 2024 **

** JAN 2024 - APRIL 2024 **

** GENERATIVE AI RESEARCH PAPERS SUMMARIES **

** FEB-MARCH-APRIL 2024 **

** JAN 2024 **

AI LEARNING RESOURCES

** AI RESEARCH PAPERS COLLECTION [JAN 2023 - DEC 2023] **

** DEC 2023 **

** Nov 2023 **

** OCT 2023**

** SEPT 2023 **

** AUG 2023 **

** JULY 2023 **

** JUNE 2023 **

** MAY 2023 **

** MARCH 2023 **

** FEB 2023 **

** JAN 2023 **

** Thank you so much for visiting my AI Research Junction@Aditi Khare - Research Papers Summaries @Generative AI @Computer Vision @Quautum AI **

** If you find my AI Research Junction@Aditi Khare useful please star my repository to support my work - Happy Learning **