TECH & OTHER NEWS

Researchers from Google AI and Tel-Aviv University Introduce PALP: A Novel Personalization Method that Allows Better Prompt Alignment of Text-to-Image Models

January 15, 2024

Researchers from Tel-Aviv University and Google Research introduced a new method of user-specific or personalized text-to-image conversion called Prompt-Aligned Personalization (PALP). Generating personalized images from text is a challenging task and requires the presence of diverse elements like specific location, style, or (/and) ambiance. Existing methods compromise personalization or prompt alignment. The most difficult challenge is to balance identity preservation and prompt alignment during personalized image generation, which often results in hindering the fulfillment of user prompts and subject fidelity.

Current text-to-image models struggle to control and align with specific prompts, requiring extensive prompt engineering and re-sampling. PALP focuses on single prompt generation by improving text alignment, which aligns the scene more efficiently as per user prompt. To prevent overfitting in the model, it uses pre-trained models’ knowledge and ensures the alignment with the prompt. It uses Score Distillation Sampling (SDS) techniques to guide the model’s prediction towards the target prompt.

PALP focuses on two main requirements for image generation: personalization and prompt alignment. Personalization fine-tunes a pre-trained model on a small set of images representing the target subject, updating specific network weights and optimizing a new word embedding for the subject. Prompt alignment prevents overfitting by using SDS to push the model’s noise prediction towards the target prompt. The method is compared with existing state-of-the-art methods like P+, NeTI, and TI+DB, and the results show that the proposed method was able to achieve the best text alignment while maintaining high image alignment. The model also showcased excellent results in qualitative evaluation with ten different complex prompts with at least four diverse elements.

PALP has addressed the challenges faced in text-to-image generation and provided the solution to the problem. PALP’s ability to generate images that align with complex and intricate prompts can be beneficial in multiple fields like content creation or on-demand image generation. The results in both qualitative and quantitative evaluations of the methods on different metrics show the possibility of improvement in personalized image generation.

Check out the Paper and Project Page. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our 36k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our Telegram Channel

Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is currently pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech enthusiast and has a keen interest in the scope of software and data science applications. She is always reading about the developments in different field of AI and ML.

[Free AI Event] 🐝 ‘Real-Time AI with Kafka and Streaming Data Analytics’ (Jan 15 2024, 10 am PST)

Source Link

Researchers from Google AI and Tel-Aviv University Introduce PALP: A Novel Personalization Method that Allows Better Prompt Alignment of Text-to-Image Models

LEAVE A REPLY Cancel reply

TECH NEWS

Everything Old is New Again: AI-Driven Development and Open Source

Gen AI in Healthcare: The State of Affairs in India

Gartner Predicts Legal, Risk and Compliance Functions to Double Technology Spend...

Microsoft to End Support for Windows Mail, Calendar and People Apps...

IDC Predicts: Asia/Pacific Business Leaders to Demand 80% Success Rate on...

The Cooling Conundrum: AI and Automation Push Data Centers Toward 3X...

TOP STORIES

Seventy Percent of Economies Are Underprepared for AI Disruption

New study shows almost half of tech professionals in India believe...

Organizations Combining Organizational Learning and AI-Specific Learning Are up to 80%...

Nvidia’s AI-driven triumph over Intel powered by strategic innovations

Most banks and insurers adopt cloud solutions with the primary objective...

India’s Web3 Ecosystem Has Over 400 Firms, Karnataka Emerges as Industry...

Cyber Security

AI and Gen AI are set to transform cybersecurity for most...

ThreatQuotient Publishes 2024 Evolution of Cybersecurity Automation Adoption Research Report

Kaspersky predicts quantum-proof ransomware and advancements in mobile financial cyberthreats in...

Rising concerns, lingering gaps: most organizations fear AI-driven cyberattacks but lack...

Tenable Forecasts Data Security in the Cloud to Take Centre Stage...

Blockchain-Enhanced Cybersecurity-Safeguarding Digital Identities and Data