FLUX Erase vs FLUX VTO — 新たな画像編集技術の比較
FLUX Eraseは、迅速かつ低コストで画像編集を実現する新技術です。過去のFLUX VTOと比較すると、特にオブジェクト消去の精度や速度において優位性を持ち、複雑な背景に対応しています。
65 articles in "新機能" — sorted by importance and recency
FLUX Eraseは、迅速かつ低コストで画像編集を実現する新技術です。過去のFLUX VTOと比較すると、特にオブジェクト消去の精度や速度において優位性を持ち、複雑な背景に対応しています。
FLUX VTO: Virtual Try-On at Scale
FLUX Virtual Try-On allows users to try on outfits before purchase with high accuracy. It generates results in under four seconds at low cost, supporting large catalogs. It aims to overcome traditiona
How Envato Built Its Creative AI Engine on FLUX
Envato rethought its creative platform to create a unified toolkit for generating, editing, and personalizing visual content using FLUX models. This led to a seamless experience for users and made FLU
FLUX Erase: Remove anything, leave no trace
FLUX Erase is a technology that removes masked objects, shadows, and reflections, reconstructing the surrounding scene. It outperforms traditional methods by being faster and more cost-effective, demo
FLUX.1 Kontext now in Adobe Photoshop: Powering Every Pixel
FLUX.1 Kontext [Pro] integrates with Adobe Photoshop, allowing creators to push their imagination further without juggling apps and files. Users can select the model, describe edits easily, and refine
OpenArt transforms video storytelling with FLUX.1 Kontext
OpenArt has transformed video storytelling by integrating FLUX.1 Kontext, addressing visual consistency challenges and making professional-quality content accessible to both expert and novice creators
FLUX Outpainting: Seamlessly Extend Any Image in Any Direction
FLUX Outpainting is a tool that extends images beyond their original frame seamlessly. Traditional outpainting tools often produce visible seams and broken lighting, but FLUX addresses these issues. U
Introducing Claude Science, an AI workbench for scientists
Anthropic has launched Claude Science, an AI workbench designed for scientists. This app integrates commonly used tools and packages, produces auditable artifacts, and provides flexible access to comp
New Features and Fixes in Claude Code v2.1.196
The new version v2.1.196 of Claude Code has been released, adding support for organization default models, improving session name readability, and enabling clickable file attachments in chat. It also
Introducing computer use in Gemini 3.5 Flash
Google DeepMind announced the introduction of computer use in Gemini 3.5 Flash, enhancing user interaction with AI for more efficient tasks. This suggests a broader application of AI, especially in pr
Introducing Claude Tag
Anthropic has introduced a new collaboration tool called 'Claude Tag'. This tool functions as a team member on Slack, allowing users to assign tasks and track progress. Claude remembers information fr
OpenAI Unveils New Daybreak Security Tools for Organizations
OpenAI has introduced new Daybreak tools, including Codex Security and GPT-5.5-Cyber, to help organizations find, validate, and patch vulnerabilities at scale, enhancing their security measures.
Introducing Luma Skills: Build a Creative Workflow Once, Run It Forever
Luma Labs introduces 'Luma Skills', a repeatable creative workflow within Luma Agents. This allows teams to create a workflow once and reuse it for different assets, achieving consistent high-quality
Improving health intelligence in ChatGPT
GPT-5.5 Instant enhances ChatGPT's health and wellness responses with stronger reasoning, better context, clearer communication, and physician-informed evaluations, enabling users to receive more reli
Introducing Deployment Simulation for Predicting Model Behavior
OpenAI has introduced 'Deployment Simulation,' a method to predict AI model behavior before deployment using real conversation data, aimed at improving safety and evaluation accuracy. This allows for
Runwayの新しい動画生成システムGen-2と、過去のモデルGen-4.5を比較しました。Gen-2はテキストや画像から動画を生成する新機能を搭載し、ユーザーの表現力を広げることが期待されます。一方、Gen-4.5は視覚的忠実度やプロンプト適合性を強化しています。両者の特徴を比較し、今後の展望を探ります。
Runwayの新モデル「Gen-4.5」と過去の「GWM-1」を比較します。Gen-4.5はビデオ生成に特化し、動きの品質や視覚的忠実度を向上させました。一方、GWM-1はリアルタイムシミュレーションに焦点を当てています。それぞれの特徴を見ていきましょう。
Runway Research | Gen-2: Generate novel videos with text, images or video clips
Runway has announced Gen-2, a new multimodal AI system that generates novel videos from text, images, or video clips. It synthesizes realistic and consistent new videos by applying styles from images
Runway Research | Scale, Speed and Stepping Stones: The path to Gen-2
Anastasis Germanidis, CTO and co-founder of Runway, discusses the development journey of Gen-2, a text-to-video system that allows direct text-guided video generation without structural conditioning.
Runway Unveils New Features for Enhanced Video Generation Control
Runway has announced new features that enhance control, fidelity, and style expression in video generation. Notably, the introduction of Motion Brush allows users to direct specific movements with a s
Runway Gen-3 Alpha: Next-Generation AI Video Generation
Runway's newly announced Gen-3 Alpha is the first next-generation foundation model trained on a new infrastructure for large-scale multimodal training. It shows significant improvements in fidelity, c
Runway Research | Introducing Frames
Runway has introduced a new image generation model called 'Frames'. This model excels at maintaining stylistic consistency while allowing for broad creative exploration. Users can set a specific look
Evolution of Creative Interfaces in the Age of Generative Media
Runway's latest prototype introduces a new interface design enabling creative exploration through the latent space of generative models. Users can treat images as nodes and construct non-linear timeli
Runway Unveils a New Frontier for Video Generation with Gen-4.5
Runway has introduced its new video generation model, 'Runway Gen-4.5.' This model achieves state-of-the-art motion quality, prompt adherence, and visual fidelity, opening up new possibilities in vide
Runway Introduces GWM-1: A State-of-the-Art General World Model
Runway has announced GWM-1, a state-of-the-art General World Model designed to simulate reality in real time. This model is interactive, controllable, and general-purpose, offering new possibilities a
How Real-Time Video Generation Is Changing Online Interaction
For most of internet history, users typed text to get results. This is changing as real-time video generation will offer a more interactive model, responding to user inputs live. This technology is ex
新たに発表されたDiffusionGemmaは、テキスト生成速度を4倍に向上させる技術です。一方、Gemma 4 12Bは視覚とテキストを同時に処理できるエンコーダーなしのマルチモーダルモデルです。両者は異なる用途に特化しており、AIの進化に貢献しています。
DiffusionGemma: 4x Faster Text Generation
Google DeepMind's new DiffusionGemma technology enhances text generation speed by four times. This advancement is expected to make AI text generation faster and more efficient, potentially impacting r
Fluid, natural voice translation with Gemini 3.5 Live Translate
Gemini 3.5 Live Translate offers near real-time, natural speech translation across Google AI Studio, Google Translate, and Google Meet, enabling smoother conversations and enhancing international comm
Introducing Gemma 4 12B: a unified, encoder-free multimodal model
Google DeepMind has announced a new unified multimodal model called Gemma 4 12B. This model operates without an encoder, capable of simultaneously processing multiple data types such as vision and tex
What Codex unlocks for Notion
Notion demonstrates how it uses Codex to one-shot specs, build AI Voice Input for the web, and enhance engineering power for small teams, marking a significant step in improving operational efficiency
ChatGPT Introduces Enhanced Memory for Improved User Experience
ChatGPT introduces a new memory system aimed at better remembering user preferences and keeping context fresh and relevant across conversations. This feature is expected to enhance interactions, allow
Introducing new capabilities to GPT-Rosalind
GPT-Rosalind has introduced new capabilities to support life sciences research. Enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental workflow capabilities
How Wasmer used Codex to build a Node.js runtime for the edge
Wasmer utilized GPT-5.5 with Codex to build a Node.js runtime for edge computing, accelerating development by 10x to 20x and enabling shipping in weeks instead of months. This advancement significantl
Codex for every role, tool, and workflow
OpenAI has unveiled new Codex plugins and annotations that help analysts, marketers, designers, and investors enhance productivity with AI. These features provide tools tailored for various roles, ass
New Features in Claude Code v2.1.158
Version v2.1.158 of Claude Code has been released, enabling Auto mode for Opus 4.7 and Opus 4.8 on Bedrock, Vertex, and Foundry. Users can opt in by setting the environment variable `CLAUDE_CODE_ENABL
Strengthening societal resilience with Rosalind Biodefense
OpenAI launches Rosalind Biodefense, expanding trusted access to GPT-Rosalind for vetted developers and U.S. government partners advancing biodefense, public health, and pandemic preparedness through
New Features and Improvements in Claude Code v2.1.154
Anthropic's Claude Code v2.1.154 has been released with many new features. Notably, dynamic workflows allow users to orchestrate work across multiple agents for larger tasks. Additionally, a cost-effe
Introducing Claude Opus 4.8
Anthropic has introduced its new model, Claude Opus 4.8, which offers stronger performance in coding and agentic tasks with consistency for long-running work. Building on Opus 4.7, it includes feature
DiffusionBlocks: Training Neural Networks One Block at a Time
Anthropic has introduced a new method called 'DiffusionBlocks' that allows training neural networks block by block, enabling more efficient learning. This approach is expected to improve performance,
Stable Audio 3.0 Released: A Family of Open-Weight Music Models
Stability AI has announced Stable Audio 3.0, a family of open-weight music models trained on fully licensed data. Users can freely distribute and commercialize their outputs, with four models availabl
Release of anthropic-sdk-python v0.115.0
Anthropic has released a new version of its SDK, v0.115.0. This update adds support for Managed Agents event delta streaming, agent overrides, reverse pagination, vault credential injection scoping, a
anthropic-sdk-typescript: Release v0.109.0
Anthropic has released version 0.109.0 of its SDK. This update adds support for Managed Agents event delta streaming, agent overrides, reverse pagination, vault credential injection scoping, and agent
Anthropic SDK v0.32.0 Released with Logger Support for AWS Credential Provider
The Anthropic SDK has been updated to version 0.32.0, adding the feature to pass a client logger to the AWS credential provider chain. This new functionality enhances developers' ability to manage AWS
Our latest Google Finance upgrades, including a new app
Google Finance has announced its latest upgrades, including a new app. This allows users to access a more user-friendly interface and new features. Notably, managing and analyzing financial data is st
Release of anthropic-sdk-python v0.112.0
Anthropic has released version 0.112.0 of its SDK. This update includes support for system.message streaming events, fixes a bug in the memory tool, adds support for a new refusal category, and allows
Release Notes for Claude Code v2.1.187
Anthropic has released an update for Claude Code, version 2.1.187. This update enhances sandbox settings, reflects organization-specific model restrictions in the model picker, and includes several bu
Codex-maxxing for long-running work
Jason Liu explores how to use Codex to preserve context and manage complex projects, enabling work to continue beyond a single prompt, ultimately improving productivity.
New Features and Fixes in Claude Code v2.1.183
The latest version of Claude Code, v2.1.183, enhances safety in auto mode by blocking certain destructive git commands. A warning feature for deprecated models and improved toggle behavior for setting
New usage analytics and updated spend controls for enterprises
OpenAI introduces new spend controls and usage analytics for ChatGPT Enterprise, enabling organizations to manage costs and scale AI with confidence. These features are expected to support companies i
New Features and Improvements in Claude Code v2.1.181
Claude Code has released version v2.1.181, introducing a new syntax `/config key=value` for interactive setting changes and an opt-in setting to allow Apple Events in sandboxed commands on macOS. Stre
Introducing LifeSciBench
LifeSciBench is an expert-authored and reviewed benchmark designed to evaluate how AI systems handle real-world life science research tasks and decisions. This benchmark is significant for measuring t
New Features and Improvements in Claude Code v2.1.178
Anthropic's Claude Code has been updated to v2.1.178, introducing new syntax for permission rules based on tool parameters, improvements to nested skill directories, and clearer error messages. These
New Features in Claude Code v2.1.175
The new version v2.1.175 of Claude Code has been released. This update adds the `enforceAvailableModels` managed setting, which impacts the Default model based on the `availableModels` allowlist. If a
How Preply combines AI and human tutors to personalize learning
Preply uses OpenAI technology to provide AI-generated lesson summaries, enabling personalized feedback and language learning exercises. This combination of AI and human tutors enhances the learning ex
Key Updates in Claude Code v2.1.169
Claude Code version 2.1.169 has been released with new features like the `--safe-mode` flag for troubleshooting and the `/cd` command for moving to a new working directory. Numerous UI improvements an
New Features and Fixes in Claude Code v2.1.166
Anthropic's Claude Code has been updated to v2.1.166, introducing new features and bug fixes. A new `fallbackModel` setting allows up to three fallback models to be tried in order when the primary mod
Release Notes for Claude Code v2.1.163
Claude Code has been updated to v2.1.163. New features include managed settings for version restrictions and a command to list installed plugins. The update also includes various bug fixes for Bash co
Travelers deploys AI-powered claims countrywide with OpenAI
Travelers has built an AI-powered Claim Assistant in collaboration with OpenAI. This system guides customers through the claims filing process, offers 24/7 support, and is designed to scale operations
Claude Code v2.1.160 Release Notes
The new version v2.1.160 of Claude Code has been released, featuring security enhancements through added prompts and various bug fixes. Notably, prompts before writing shell startup and build-tool con
11 demos of Gemini Omni and Gemini 3.5 in action
New demos of Gemini Omni and Gemini 3.5 have been released, showcasing practical applications of AI. These demos help users understand new features and performance, making them particularly important
Release Notes for Claude Code v2.1.153
The latest version of Claude Code, v2.1.153, has been released with various feature enhancements and bug fixes. A new `skipLfs` option allows skipping Git LFS downloads during clone. A one-time notice
Claude Code Release v2.1.152 Overview
The new version v2.1.152 of Claude Code has been released, featuring improvements to code review and skill management functions. Notably, the '/code-review --fix' command can now apply review findings
Claude Code v2.1.147 Release Changes
Anthropic has released Claude Code version 2.1.147. This version introduces a new Workflow feature, renames commands, and enhances security for REPL and Workflow tools. It also fixes bugs related to e
Release of anthropic-sdk-python v0.104.0
The new version v0.104.0 of the Anthropic SDK has been released, adding a beta feature for thinking-token-count that supports estimated tokens in thinking block deltas during streaming. This release i